2024 Pivotal Fellows
-
Kristina Fort
The AI Safety Institutes and International AI Coordination
-
Jonathan Gibson
Which confidence-building measures are most politically viable for US-China Track II AI dialogues?
-
Diogo Cruz
Understanding the learned look-ahead behavior of chess neural networks
-
Meenakshi Balan
Modeling pre-superhuman persuasion risks from agentic systems
-
Aidan Homewood
Third-party auditing of frontier models: case studies from non-AI industries
-
Felix Porée
Understanding apocalyptic bioterrorism and its effects on rational deterrence theories
-
Monika Jotautaite
LLM moral value evaluations
-
Lennart Finke
Guarding Democracies with Evaluations of Political Persuasion
-
Abel Ashby
Developing frameworks for red teaming DNA synthesis screening
-
Jakub Krys
Transferability of adversarial attacks between modalities of vision-language models
-
Jeanne Salle
Predicting models capabilities through observational scaling laws
-
Joss Oliver
The Multi-agent Dynamics of Elliott Thornley's TD-agents
-
Luise Woehlke
Will the US Government Control the First AGI?—Lessons From History
-
Michel Justen
Sharing the AI Windfall: A Strategic Approach to International Benefit-Sharing
-
Natasha Karner
Runaway Military AI: Governance Initiatives for AI in the Military Domain