๐ Our paper SWAN is accepted at ICML 2025!
Apr 1, 2025ยท
ยท
1 min read
Wenbo Gong

๐ฅณ๐ฅ Our paper SWAN:SGD with Normalization and Whitening Enables Stateless LLM Training has been accepted in ICML 2025 conference. This optimizer allows stateless LLM training to maximize the memory efficiency, while achieving on-par or better performance than standard AdamW optimizer. ๐ฅณ๐ฅ