GaLore
终于有空写点东西
组合数学(一)容斥原理、二项式反演与第二类斯特林数
为什么我还在复习一年级的东西,玉玉了
Gamma Function
阶乘在实数域与复数域的扩展
Part IV of Mathematical Structure of Mamba - Mamba&Mamba2
Tridao的tri是Triton的tri
Triton Tutorial
都什么年代,还在用传统pytorch
Part III of Mathematical Structure of Mamba - S4D
(NIPS 22) S4D - On the Parameterization and Initialization of Diagonal State Space Models
Sparsemax
凸优化没白学
Part II of Mathematical Structure of Mamba - S4
(ICLR22) S4 - Efficiently Modeling Long Sequences with Structured State Spaces
Part I of Mathematical Structure of Mamba - Hippo
(NIPS20) HiPPO - Recurrent Memory with Optimal Polynomial Projections
Policy Gradient
策略梯度算法随笔