AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling Zhining Zhang*, Chuanyang Jin*†, Mung Yao Jia*, Shunchi Zhang*, Tianmin Shu (†: project lead) NeurIPS 2025 (Spotlight) paper / project / code
C6
SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions Xianzhe Fan, Xuhui Zhou, Chuanyang Jin, Kolby Nottingham, Hao Zhu, Maarten Sap NeurIPS D&B 2025 paper / code / benchmark
C5
Do VLMs have internal World Models? Towards an Atomic Evaluation Qiyue Gao*, Xinyu Pi*, Kevin Liu, Junrong Chen, Ruolan Yang, Xinqi Huang, Xinyu Fang, Lu Sun, Gautham Kishore, Bo Ai, Stone Tao, Mengyang Liu, Jiaxi Yang, Chao-Jung Lai, Chuanyang Jin, Jiannan Xiang, Benhao Huang, Zeming Chen, David Danks, Hao Su, Tianmin Shu, Ziqiao Ma, Lianhui Qin, Zhiting Hu ACL 2025 Findings (Huggingface Daily Papers Top-3) paper / project / benchmark
C4
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Qiushi Sun*, Kanzhi Cheng*, Zichen Ding*, Chuanyang Jin*, Yian Wang, Fangzhi Xu, Zhenyu Wu, Chengyou Jia, Liheng Chen, Zhoumianze Liu, Ben Kao, Guohao Li, Junxian He, Yu Qiao, Zhiyong Wu ACL 2025 (Huggingface Daily Papers Top-1) paper / project / code / model / data / slides
C3
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind Haojun Shi*, Suyu Ye*, Xinyu Fang, Chuanyang Jin, Leyla Isik, Yen-Ling Kuo, Tianmin Shu AAAI 2025 (Oral) paper / project / code / benchmark / leaderboard / slides
C2
MMToM-QA: Multimodal Theory of Mind Question Answering Chuanyang Jin, Yutong Wu, Jing Cao, Jiannan Xiang, Yen-Ling Kuo, Zhiting Hu, Tomer Ullman, Antonio Torralba, Joshua Tenenbaum, Tianmin Shu ACL 2024 (Outstanding Paper Award) paper / project / code / benchmark / leaderboard / slides / Futurity news / JHU news
J2
How Far Are We From AGI? Tao Feng*, Chuanyang Jin*, Jingyu Liu*, Kunlun Zhu*, Haoqin Tu, Zirui Cheng, Guanyu Lin, Jiaxuan You TMLR 2024 / ICLR 2024 AGI Workshop (Oral) paper / project
C1
Neural Amortized Inference for Nested Multi-agent Reasoning Kunal Jha, Tuan Anh Le, Chuanyang Jin, Yen-Ling Kuo, Joshua Tenenbaum, Tianmin Shu AAAI 2024 / AAAI 2024 Summer Symposium (Oral) paper / project / code / slides
J1
Dynamics of RNA Localization to Nuclear Speckles are Connected to Splicing Efficiency Jinjun Wu*, Yu Xiao*, Yunzheng Liu*, Li Wen,Chuanyang Jin, Shun Liu, Sneha Paul, Chuan He, Oded Regev, Jingyi Fei Science Advances 10 (42), eadp7727 paper
Preprints and Under Review
P1
The Era of Real-World Human Interaction: RL from User Conversations Chuanyang Jin, Jing Xu, Bo Liu, Leitian Tao, Olga Golovneva, Tianmin Shu, Wenting Zhao, Xian Li, Jason Weston arXiv Preprint (Invited Talk at Google; Paper of the Week by Huggingface/DAIR.AI/TuringPost) paper
P2
SPICE: Self-Play In Corpus Environments Improves Reasoning Bo Liu, Chuanyang Jin, Seungone Kim, Weizhe Yuan, Wenting Zhao, Ilia Kulikov, Xian Li, Sainbayar Sukhbaatar, Jack Lanchantin, Jason Weston arXiv Preprint paper
Workshop and Technical Reports
W3
Humanity’s Last Exam Center for AI Safety & Scale AI Technical Report (featured in The New York Times and Reuters) paper
W2
Beyond the Binary: Capturing Diverse Preferences With Reward Regularization Vishakh Padmakumar*,Chuanyang Jin*, Hannah Rose Kirk*, He He NeurIPS 2024 Workshop on Socially Responsible Language Modelling Research paper / code
W1
The Cultural Psychology of Large Language Models Chuanyang Jin*, Songyang Zhang*, Tianmin Shu, Zhihan Cui Technical Report paper
Open-Source Projects
O2
OpenCompass: A Universal Evaluation Platform for Foundation Models OpenCompass Team GitHub repository (Over 6k stars)
O1
Fast-DiT: Fast Diffusion Models with Transformers Chuanyang Jin, Saining Xie GitHub repository (Over 800 stars)