GPT-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023 | 1510* | 2023 |
GPT-J-6B: A 6 Billion Parameter Autoregressive Language Model B Wang, A Komatsuzaki https://github.com/kingoflolz/mesh-transformer-jax#gpt-j-6b, 2021 | 649 | 2021 |
GPT-NeoX-20B: An Open-Source Autoregressive Language Model S Black, S Biderman, E Hallahan, Q Anthony, L Gao, L Golding, H He, ... arXiv preprint arXiv:2204.06745, 2022 | 536 | 2022 |
Mesh-Transformer-JAX: Model-Parallel Implementation of Transformer Language Model with JAX B Wang https://github.com/kingoflolz/mesh-transformer-jax 2, 2021 | 87 | 2021 |
A framework for few-shot language model evaluation L Gao, J Tow, S Biderman, S Black, A DiPofi, C Foster, L Golding, J Hsu, ... Version v0. 0.1. Sept, 8, 2021 | 70 | 2021 |
A framework for few-shot language model evaluation, 12 2023 L Gao, J Tow, B Abbasi, S Biderman, S Black, A DiPofi, C Foster, ... URL https://zenodo. org/records/10256836 7, 0 | 28 | |
A framework for few-shot language model evaluation, Sept. 2021 L Gao, J Tow, S Biderman, S Black, A DiPofi, C Foster, L Golding, J Hsu, ... URL https://doi. org/10 5281, 0 | 9 | |
LFTag: A scalable visual fiducial system with low spatial frequency B Wang 2020 2nd International Conference on Advances in Computer Technology …, 2020 | 8 | 2020 |