Skip to yearly menu bar Skip to main content


Poster

Fully Unconstrained Online Learning

Ashok Cutkosky · Zak Mhammedi

[ ]
Wed 11 Dec 4:30 p.m. PST — 7:30 p.m. PST

Abstract: We provide a technique for OLO that obtains regret $G\|w_\star\|\sqrt{T\log(\|w_\star\|G\sqrt{T})} + \|w_\star\|^2 + G^2$ on $G$-Lipschitz losses for any comparison point $w_\star$ without knowing either $G$ or $\|w_\star\|$. Importantly, this matches the optimal bound $G\|w_\star\|\sqrt{T}$ available with such knowledge (up to logarithmic factors), unless either $\|w_\star\|$ or $G$ is so large that even $G\|w_\star\|\sqrt{T}$ is roughly linear in $T$. Thus, it matches the optimal bound in all cases in which one can achieve sublinear regret, which arguably encompasses all "interesting" scenarios.

Live content is unavailable. Log in and register to view live content