AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling Zhining Zhang*, Chuanyang Jin*†, Mung Yao Jia*, Shunchi Zhang*, Tianmin Shu (†: project lead) NeurIPS 2025 (Spotlight) paper / project / code
C6
SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions Xianzhe Fan, Xuhui Zhou, Chuanyang Jin, Kolby Nottingham, Hao Zhu, Maarten Sap NeurIPS D&B 2025 paper / code / benchmark
C5
Do VLMs have internal World Models? Towards an Atomic Evaluation Qiyue Gao*, Xinyu Pi*, Kevin Liu, Junrong Chen, Ruolan Yang, Xinqi Huang, Xinyu Fang, Lu Sun, Gautham Kishore, Bo Ai, Stone Tao, Mengyang Liu, Jiaxi Yang, Chao-Jung Lai, Chuanyang Jin, Jiannan Xiang, Benhao Huang, Zeming Chen, David Danks, Hao Su, Tianmin Shu, Ziqiao Ma, Lianhui Qin, Zhiting Hu ACL 2025 Findings paper / project / benchmark
C4
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Qiushi Sun*, Kanzhi Cheng*, Zichen Ding*, Chuanyang Jin*, Yian Wang, Fangzhi Xu, Zhenyu Wu, Chengyou Jia, Liheng Chen, Zhoumianze Liu, Ben Kao, Guohao Li, Junxian He, Yu Qiao, Zhiyong Wu ACL 2025 (Huggingface Daily Paper Top-1) paper / project / code / model / data / slides
C3
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind Haojun Shi*, Suyu Ye*, Xinyu Fang, Chuanyang Jin, Leyla Isik, Yen-Ling Kuo, Tianmin Shu AAAI 2025 (Oral) paper / project / code / benchmark / leaderboard / slides
C2
MMToM-QA: Multimodal Theory of Mind Question Answering Chuanyang Jin, Yutong Wu, Jing Cao, Jiannan Xiang, Yen-Ling Kuo, Zhiting Hu, Tomer Ullman, Antonio Torralba, Joshua Tenenbaum, Tianmin Shu ACL 2024 (Outstanding Paper Award) paper / project / code / benchmark / leaderboard / slides / Futurity news / JHU news
J2
How Far Are We From AGI? Tao Feng*, Chuanyang Jin*, Jingyu Liu*, Kunlun Zhu*, Haoqin Tu, Zirui Cheng, Guanyu Lin, Jiaxuan You TMLR 2024 / ICLR 2024 AGI Workshop (Oral) paper / project
C1
Neural Amortized Inference for Nested Multi-agent Reasoning Kunal Jha, Tuan Anh Le, Chuanyang Jin, Yen-Ling Kuo, Joshua Tenenbaum, Tianmin Shu AAAI 2024 / AAAI 2024 Summer Symposium (Oral) paper / project / code / slides
J1
Dynamics of RNA Localization to Nuclear Speckles are Connected to Splicing Efficiency Jinjun Wu*, Yu Xiao*, Yunzheng Liu*, Li Wen,Chuanyang Jin, Shun Liu, Sneha Paul, Chuan He, Oded Regev, Jingyi Fei Science Advances 10 (42), eadp7727 paper
Preprints and Under Review
Workshop and Technical Reports
W3
Humanity’s Last Exam Center for AI Safety & Scale AI Technical Report paper
W2
Beyond the Binary: Capturing Diverse Preferences With Reward Regularization Vishakh Padmakumar*,Chuanyang Jin*, Hannah Rose Kirk*, He He NeurIPS 2024 Workshop on Socially Responsible Language Modelling Research paper / code
W1
The Cultural Psychology of Large Language Models Chuanyang Jin*, Songyang Zhang*, Tianmin Shu, Zhihan Cui Technical Report paper
Open-Source Projects
O2
OpenCompass: A Universal Evaluation Platform for Foundation Models OpenCompass Team GitHub repository (Over 5k stars)
O1
Fast-DiT: Fast Diffusion Models with Transformers Chuanyang Jin, Saining Xie GitHub repository (Over 800 stars)