Follow
Markus Anderljung
Markus Anderljung
Centre for the Governance of AI
Verified email at governance.ai - Homepage
Title
Cited by
Cited by
Year
Toward trustworthy AI development: mechanisms for supporting verifiable claims
M Brundage, S Avin, J Wang, H Belfield, G Krueger, G Hadfield, H Khlaaf, ...
arXiv preprint arXiv:2004.07213, 2020
3512020
Model evaluation for extreme risks
T Shevlane, S Farquhar, B Garfinkel, M Phuong, J Whittlestone, J Leung, ...
arXiv preprint arXiv:2305.15324, 2023
77*2023
Ethics and governance of artificial intelligence: Evidence from a survey of machine learning researchers
B Zhang, M Anderljung, L Kahn, N Dreksler, MC Horowitz, A Dafoe
Journal of Artificial Intelligence Research 71, 591–666-591–666, 2021
592021
Institutionalizing ethics in AI through broader impact requirements
CEA Prunkl, C Ashurst, M Anderljung, H Webb, J Leike, A Dafoe
Nature Machine Intelligence 3 (2), 104-110, 2021
532021
Frontier AI regulation: Managing emerging risks to public safety
M Anderljung, J Barnhart, J Leung, A Korinek, C O'Keefe, J Whittlestone, ...
arXiv preprint arXiv:2307.03718, 2023
51*2023
The Brussels effect and artificial intelligence: How EU regulation will impact the global AI market
C Siegmann, M Anderljung
arXiv preprint arXiv:2208.12645, 2022
402022
Towards best practices in AGI safety and governance: A survey of expert opinion
J Schuett, N Dreksler, M Anderljung, D McCaffary, L Heim, E Bluemke, ...
arXiv preprint arXiv:2305.07153, 2023
322023
Filling gaps in trustworthy development of AI
NZ Shahar Avin, Haydn Belfield, Miles Brundage, Gretchen Krueger, Jasmine ...
Science 374 (6573), pp. 1327-1329, 2021
302021
Forecasting AI progress: Evidence from a survey of machine learning researchers
B Zhang, N Dreksler, M Anderljung, L Kahn, C Giattino, A Dafoe, ...
arXiv preprint arXiv:2206.04132, 2022
222022
Protecting Society from AI Misuse: When are Restrictions on Capabilities Warranted?
M Anderljung, J Hazell
arXiv preprint arXiv:2303.09377, 2023
202023
Social and governance implications of improved data efficiency
AD Tucker, M Anderljung, A Dafoe
Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, 378-384, 2020
132020
Open-sourcing highly capable foundation models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives
E Seger, N Dreksler, R Moulange, E Dardaman, J Schuett, K Wei, ...
arXiv preprint arXiv:2311.09227, 2023
92023
AI policy levers: A review of the US Government’s tools to shape AI research, development, and deployment
SC Fischer, J Leung, M Anderljung, C O’keefe, S Torges, SM Khan, ...
Retrieved June 1, 2022, 2021
82021
A guide to writing the NeurIPS impact statement
C Ashurst, M Anderljung, C Prunkl, J Leike, Y Gal, T Shevlane, A Dafoe
Centre for the Governance of AI. URL: https://perma. cc/B5R8-2B9V, 2020
82020
Skilled and Mobile: Survey Evidence of AI Researchers' Immigration Preferences
R Zwetsloot, B Zhang, N Dreksler, L Kahn, M Anderljung, A Dafoe, ...
Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society, 1050 …, 2021
72021
Towards publicly accountable frontier llms
M Anderljung, E Smith, J O'Brien, L Soder, B Bucknall, E Bluemke, ...
Socially Responsible Language Modelling Research, 2023
3*2023
Ethics and governance of artificial intelligence: a survey of machine learning researchers
B Zhang, M Anderljung, L Kahn, N Dreksler, MC Horowitz, A Dafoe
IJCAI, 5787-5791, 2022
32022
The Immigration Preferences of Top AI Researchers: New Survey Evidence
R Zwetsloot, B Zhang, M Anderljung, MC Horowitz, A Dafoe
Perry World House and The Future of Humanity Institute, 2021
32021
Computing Power and the Governance of Artificial Intelligence
G Sastry, L Heim, H Belfield, M Anderljung, M Brundage, J Hazell, ...
arXiv preprint arXiv:2402.08797, 2024
22024
Compute Funds and Pre-Trained Models: Govai Blog
M Anderljung, L Heim, T Shevlane
RSS. Accessed May 13, 2023
22023
The system can't perform the operation now. Try again later.
Articles 1–20