[Submitted on 1 Nov 2025]
AMO: Analysis of Adaptive Momentum Optimization
View PDFAbstract:We analyze Adaptive Momentum Optimization (AMO) for language models. While showing stable training, AMO underperformed standard baselines (validation loss 9.773 vs AdamW's 4.9266). Our negative results contribute insights into momentum adaptation challenges.
Submission history
[v1] Sat, 1 Nov 2025 06:45 UTC