Recap & context

Online Convex Optimization

<aside> đźš§ Online convex optimization (OCO)

For $t=1,2,\ldots,T$:

  1. player/learner choses decision $w_t \in W$
  2. adversary chooses convex loss function $f_t : W \mapsto \R$
  3. player incurs loss $f_t(w_t)$ and observes $f_t$ as feedback

Player’s goal is to minimize regret:

$$ R_T = \sum_{t=1}^T f_t(w_t) - \min_{w^* \in W} \sum_{t=1}^T f_t(w^*) $$

</aside>