Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension

Jan 1, 2025ยท
Wenbo Gong
Wenbo Gong
,
M Scetbon
,
C Ma
,
E Meeds
ยท 0 min read
Type
Publication
arXiv:2502.07752
Wenbo Gong
Authors
Senior Researcher
Senior Researcher at Microsoft Research Cambridge working on learning dynamics and optimization for foundation models, with prior work on causality and approximate inference.