Cladder: Assessing causal reasoning in language models Z Jin, Y Chen, F Leeb, L Gresele, O Kamal, LYU Zhiheng, K Blin, ... Thirty-seventh Conference on Neural Information Processing Systems, 2023 | 11 | 2023 |
Cladder: A benchmark to assess causal reasoning capabilities of language models Z Jin, Y Chen, F Leeb, L Gresele, O Kamal, Z Lyu, K Blin, ... Advances in Neural Information Processing Systems 36, 2024 | 8 | 2024 |