Classification of long sequential data using circular dilated convolutional neural networks L Cheng, R Khalitov, T Yu, J Zhang, Z Yang Neurocomputing 518, 50-59, 2023 | 26 | 2023 |
ChordMixer: A scalable neural attention model for sequences with different lengths R Khalitov, T Yu, L Cheng, Z Yang arXiv preprint arXiv:2206.05852, 2022 | 15 | 2022 |
Sparse factorization of square matrices with application to neural attention modeling R Khalitov, T Yu, L Cheng, Z Yang Neural Networks 152, 160-168, 2022 | 9 | 2022 |
Paramixer: Parameterizing mixing links in sparse factors works better than dot-product self-attention T Yu, R Khalitov, L Cheng, Z Yang Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 6 | 2022 |
Sparse Factorization of Large Square Matrices R Khalitov, T Yu, L Cheng, Z Yang arXiv preprint arXiv:2109.08184, 2021 | 4 | 2021 |
Self-supervised learning for DNA sequences with circular dilated convolutional networks L Cheng, T Yu, R Khalitov, Z Yang Neural Networks 171, 466-473, 2024 | 1 | 2024 |
Self-Distillation Improves DNA Sequence Inference T Yu, L Cheng, R Khalitov, EB Olsson, Z Yang arXiv preprint arXiv:2405.08538, 2024 | | 2024 |
A Sparse and Wide Neural Network Model for DNA Sequences T Yu, L Cheng, R Khalitov A Sparse and Wide Neural Network Model for DNA Sequences, 0 | | |