Publications

(* denotes equal contribution; † denotes project lead)

Peer-Reviewed Papers

C7
AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling Zhining Zhang*, Chuanyang Jin*†, Mung Yao Jia*, Shunchi Zhang*, Tianmin Shu (†: project lead)
NeurIPS 2025 (Spotlight)
paper / project / code
C6
SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions Xianzhe Fan, Xuhui Zhou, Chuanyang Jin, Kolby Nottingham, Hao Zhu, Maarten Sap
NeurIPS D&B 2025
paper / code / benchmark
C5
Do VLMs have internal World Models? Towards an Atomic Evaluation Qiyue Gao*, Xinyu Pi*, Kevin Liu, Junrong Chen, Ruolan Yang, Xinqi Huang, Xinyu Fang, Lu Sun, Gautham Kishore, Bo Ai, Stone Tao, Mengyang Liu, Jiaxi Yang, Chao-Jung Lai, Chuanyang Jin, Jiannan Xiang, Benhao Huang, Zeming Chen, David Danks, Hao Su, Tianmin Shu, Ziqiao Ma, Lianhui Qin, Zhiting Hu
ACL 2025 Findings
paper / project / benchmark
C4
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Qiushi Sun*, Kanzhi Cheng*, Zichen Ding*, Chuanyang Jin*, Yian Wang, Fangzhi Xu, Zhenyu Wu, Chengyou Jia, Liheng Chen, Zhoumianze Liu, Ben Kao, Guohao Li, Junxian He, Yu Qiao, Zhiyong Wu
ACL 2025 (Huggingface Daily Paper Top-1)
paper / project / code / model / data / slides
C3
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind Haojun Shi*, Suyu Ye*, Xinyu Fang, Chuanyang Jin, Leyla Isik, Yen-Ling Kuo, Tianmin Shu
AAAI 2025 (Oral)
paper / project / code / benchmark / leaderboard / slides
C2
MMToM-QA: Multimodal Theory of Mind Question Answering Chuanyang Jin, Yutong Wu, Jing Cao, Jiannan Xiang, Yen-Ling Kuo, Zhiting Hu, Tomer Ullman, Antonio Torralba, Joshua Tenenbaum, Tianmin Shu
ACL 2024 (Outstanding Paper Award)
paper / project / code / benchmark / leaderboard / slides / Futurity news / JHU news
J2
How Far Are We From AGI? Tao Feng*, Chuanyang Jin*, Jingyu Liu*, Kunlun Zhu*, Haoqin Tu, Zirui Cheng, Guanyu Lin, Jiaxuan You
TMLR 2024 / ICLR 2024 AGI Workshop (Oral)
paper / project
C1
Neural Amortized Inference for Nested Multi-agent Reasoning Kunal Jha, Tuan Anh Le, Chuanyang Jin, Yen-Ling Kuo, Joshua Tenenbaum, Tianmin Shu
AAAI 2024 / AAAI 2024 Summer Symposium (Oral)
paper / project / code / slides
J1
Dynamics of RNA Localization to Nuclear Speckles are Connected to Splicing Efficiency Jinjun Wu*, Yu Xiao*, Yunzheng Liu*, Li Wen, Chuanyang Jin, Shun Liu, Sneha Paul, Chuan He, Oded Regev, Jingyi Fei
Science Advances 10 (42), eadp7727
paper

Preprints and Under Review

Workshop and Technical Reports

W3
Humanity’s Last Exam Center for AI Safety & Scale AI
Technical Report
paper
W2
Beyond the Binary: Capturing Diverse Preferences With Reward Regularization Vishakh Padmakumar*, Chuanyang Jin*, Hannah Rose Kirk*, He He
NeurIPS 2024 Workshop on Socially Responsible Language Modelling Research
paper / code
W1
The Cultural Psychology of Large Language Models Chuanyang Jin*, Songyang Zhang*, Tianmin Shu, Zhihan Cui
Technical Report
paper

Open-Source Projects

O2
OpenCompass: A Universal Evaluation Platform for Foundation Models OpenCompass Team
GitHub repository (Over 5k stars)
O1
Fast-DiT: Fast Diffusion Models with Transformers Chuanyang Jin, Saining Xie
GitHub repository (Over 800 stars)

© Chuanyang Jin, 2023
Powered by Hydejack