Monarch: Expressive structured matrices for efficient and accurate training T Dao, B Chen, NS Sohoni, A Desai, M Poli, J Grogan, A Liu, A Rao, ... International Conference on Machine Learning, 4690-4721, 2022 | 55 | 2022 |
Monarch mixer: A simple sub-quadratic gemm-based architecture D Fu, S Arora, J Grogan, I Johnson, ES Eyuboglu, A Thomas, B Spector, ... Advances in Neural Information Processing Systems 36, 2024 | 11 | 2024 |