Regularized Linear Function Spaces (\(\ell_2-\ell_2\))
We continue to work with linear function classes, but in contrast with the previous section Sparse Linear Function Spaces (\ell_1-\ell_1), the learner and adversary function spaces are equipped with the \(\ell_2\)-norm. This difference will translate to modifying \(R_{\min}\) and \(R_{\max}\) in Proposition 17, given that the dual norm for the spaces \(\Theta\) and \(W\) in this setting is again the \(\ell_2\)-norm. In particular, for the sequential estimators we will take
and
for the joint estimator, since these regularizers are 1-strongly convex in their respective domains. In these cases, the updates will be essentially optimistic gradient descent. In the following subsections we only state the corresponding Lemmas analogous to section Sparse Linear Function Spaces (\ell_1-\ell_1).
Estimator 1
FTRL Iterates for Estimator 1
Consider the iterates for \(t=1,\ldots, T\):
with \(\tilde{\alpha}_{-1} = \tilde{\alpha}_{0}=\tilde{\theta}_{1,-1}=\tilde{\theta}_{1,0} = 0\), and \(\eta = \frac{1}{8\|\mathbb{E}_n[aa^\top]\|_2}\).
Then, \(\bar{\alpha} = \frac{1}{T}\sum_{t=1}^{T}\alpha_t\), is a \(O(T^{-1})\)-approximate solution to
|
Sparse Linear NPIV estimator using \(\ell_2-\ell_2\) optimization. |
Estimator 2
FTRL Iterates for Estimator 2
Consider the iterates for \(t=1,\ldots, T\):
with \(\tilde{\alpha}_{-1} = \tilde{\alpha}_{0}=\tilde{\theta}_{1,-1}=\tilde{\theta}_{1,0} = 0\), and \(\eta = \frac{1}{8\|\mathbb{E}_n[aa^\top]\|_2}\).
Then, \(\bar{\alpha} = \frac{1}{T}\sum_{t=1}^{T}\alpha_t\), is a \(O(T^{-1})\)-approximate solution to
Sparse Ridge NPIV estimator using \(\ell_2-\ell_2\) optimization. |
Estimator 3 - (Ridge)
FTRL Iterates for Estimator 3 (Ridge)
Consider the iterates for \(t=1,\ldots, T\):
with \(\tilde{\alpha}_{-1} = \tilde{\alpha}_{0} = \tilde{\beta}_{-1} = \tilde{\beta}_{0}= \theta_{1,-1}=\theta_{1,0} = \theta_{2,-1}=\theta_{2,0}= 0\), and \(\eta = [16\max\left\{\left\|\mathbb{E}_n[ac'^\top]\right\|_2, \left\|\mathbb{E}_n[ac^\top]\right\|_2, \left\| \mathbb{E}_n[bc^\top]\right\|_2\right\}]^{-1}\).
Then,
are a \(O(T^{-1})\)-approximate solution for
|
Sparse Ridge NPIV estimator using \(\ell_2-\ell_2\) optimization for nested NPIV. |
Estimator 3 - (\(\ell_2\)-norm)
FTRL Iterates for Estimator 3 - (\(\ell_2\)-norm)
Consider the iterates for \(t=1,\ldots, T\):
with \(\tilde{\alpha}_{-1} = \tilde{\alpha}_{0} = \tilde{\beta}_{-1} = \tilde{\beta}_{0}= \theta_{1,-1}=\theta_{1,0} = \theta_{2,-1}=\theta_{2,0}= 0\), and \(\eta = [16\max\left\{\left\|\mathbb{E}_n[ac'^\top]\right\|_2, \left\|\mathbb{E}_n[ac^\top]\right\|_2, \left\| \mathbb{E}_n[bc^\top]\right\|_2\right\}]^{-1}\).
Then,
are a \(O(T^{-1})\)-approximate solution for
|
Sparse Linear NPIV estimator using \(\ell_2-\ell_2\) optimization for nested NPIV. |