Preprints
Neha Balamurugan, Sarah A. Wu, Adam Chun, Gabe Gaw, Cristobal Eyzaguirre, and Tobias Gerstenberg
(submitted).
Spot The Ball: A Benchmark for Visual Social Inference.
arXiv:2511.00261 [cs.CV].
Publications
Carlota Parés-Morlans, Michelle Yi, Claire Chen, Sarah A. Wu, Rika Antonova, Tobias Gerstenberg, and Jeannette Bohg
(2025).
Causal-PIK: Causality-based physical reasoning with a physics-informed kernel.
International Conference on Machine Learning (ICML).
Emily Jin, Zhuoyi Huang, Jan-Philipp Fränken, Weiyu Liu, Hannah Cha, Erik Brockbank, Sarah A. Wu, Ruohan Zhang, Jiajun Wu, and Tobias Gerstenberg
(2024).
MARPLE: A Benchmark for Long-Horizon Inference.
Advances in Neural Information Processing Systems (NeurIPS).
Sarah A. Wu and Tobias Gerstenberg
(2023).
If not me, then who? Responsibility and replacement.
Cognition, 242, 105646.
Rose E. Wang*, Sarah A. Wu*, James A. Evans, Joshua B. Tenenbaum, David C. Parkes, and Max Kleiman-Weiner
(2021).
Too many cooks: Bayesian inference for coordinating multi-agent collaboration.
In S. Muggleton and N. Charter (Ed.),
Human-like Machine Intelligence, 152-170.
Oxford University Press.
Sarah A. Wu*, Rose E. Wang*, James A. Evans, Joshua B. Tenenbaum, David C. Parkes, and Max Kleiman-Weiner
(2021).
Too many cooks: Bayesian inference for coordinating multi-agent collaboration.
Topics in Cognitive Science, 13(2), 414-432.
★ CogSci 2020 – Computational Modeling Prize in High Cognition
★ NeurIPS 2020 CoopAI Workshop – Best Paper Award
Sarah A. Wu and Edward Gibson
(2021).
Word order predicts cross-linguistic differences in the production of redundant color and number modifiers.
Cognitive Science, 45(1), e12934.
Other Publications
Verona Teo, Sarah A. Wu, Erik Brockbank, and Tobias Gerstenberg
(2025).
Leave a trace: Recursive reasoning about deceptive behavior.
Proceedings of the 47th Annual Conference of the Cognitive Science Society.
Sarah A. Wu*, Erik Brockbank*, Hannah Cha, Jan-Philipp Fränken, Emily Jin, Zhuoyi Huang, Weiyu Liu, Ruohan Zhang, Jiajun Wu, and Tobias Gerstenberg
(2024).
Whodunnit? Inferring what happened from multimodal evidence.
Proceedings of the 46th Annual Conference of the Cognitive Science Society.
Sarah A. Wu, Xiang Ren, Sydney Levine
(2024).
Resource-rational moral judgment.
Proceedings of the 46th Annual Conference of the Cognitive Science Society.
Sarah A. Wu, Shruti Sridhar, and Tobias Gerstenberg
(2023).
A computational model of responsibility judgments from counterfactual simulations and intention inferences.
Proceedings of the 45th Annual Conference of the Cognitive Science Society.