Context & scope

Last lecture:

This lecture will cover:

Learning from Expert Advice

One of the most fundamental problems in ML (and arguably more broadly in CS):

<aside> 🚧 The Experts problem:

For $t=1,2,\ldots,T$:

  1. learner choses one of $N$ experts, $i_t \in [N]$
  2. experts losses are revealed: $\ell_t \in [0,1]^N$
  3. player incurs loss of chosen expert, $\ell_t(i_t)$

Learner’s goal is to minimize regret compared to best expert in hindsight:

$$ R_T = \sum_{t=1}^T \ell_t(i_t) - \min_{i^* \in [N]} \sum_{t=1}^T \ell_t(i^*) $$

</aside>