残差连接的数学视角(一):mHC
从李群的视角看 ROPE 旋转位置编码
Part II of Symplectic Geometry - Properties
Part I of Symplectic Geometry - The Basis
Functional Analysis - Quick Explanation
Mousse - Rectifying the Geometry of Muon with Curvature-Aware Preconditioning
Nesterov
Functional Analysis 1 - Dual Space Solution of Optimal Control
Hessian 谱的 "Bulk + Spikes" 结构
Scaling Law