Part of Advances in Neural Information Processing Systems 37 (NeurIPS 2024) Main Conference Track
Ashok Cutkosky, Zakaria Mhammedi
We provide a technique for OLO that obtains regret G‖ on G-Lipschitz losses for any comparison point w_\star without knowing either G or \|w_\star\|. Importantly, this matches the optimal bound G\|w_\star\|\sqrt{T} available with such knowledge (up to logarithmic factors), unless either \|w_\star\| or G is so large that even G\|w_\star\|\sqrt{T} is roughly linear in T. Thus, at a high level it matches the optimal bound in all cases in which one can achieve sublinear regret.