๐ŸŽ‰ Our paper SWAN is accepted at ICML 2025!

Apr 1, 2025ยท
Wenbo Gong
Wenbo Gong
ยท 1 min read

๐Ÿฅณ๐Ÿฅ‚ Our paper SWAN:SGD with Normalization and Whitening Enables Stateless LLM Training has been accepted in ICML 2025 conference. This optimizer allows stateless LLM training to maximize the memory efficiency, while achieving on-par or better performance than standard AdamW optimizer. ๐Ÿฅณ๐Ÿฅ‚

Wenbo Gong
Authors
Senior Researcher
Senior Researcher at Microsoft Research Cambridge working on learning dynamics and optimization for foundation models, with prior work on causality and approximate inference.