OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Qiushi Sun*, Kanzhi Cheng*, Zichen Ding*, Chuanyang Jin*, Yian Wang, Fangzhi Xu, Zhenyu Wu, Chengyou Jia, Liheng Chen, Zhoumianze Liu, Ben Kao, Guohao Li, Junxian He, Yu Qiao, Zhiyong Wu ACL 2025 (Huggingface Daily Paper Top-1) paper / project / code / model / data / slides
C3
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind Haojun Shi*, Suyu Ye*, Xinyu Fang, Chuanyang Jin, Leyla Isik, Yen-Ling Kuo, Tianmin Shu AAAI 2025 (Oral) paper / project / code / benchmark / leaderboard / slides
C2
MMToM-QA: Multimodal Theory of Mind Question Answering Chuanyang Jin, Yutong Wu, Jing Cao, Jiannan Xiang, Yen-Ling Kuo, Zhiting Hu, Tomer Ullman, Antonio Torralba, Joshua Tenenbaum, Tianmin Shu ACL 2024 (Outstanding Paper Award) paper / project / code / benchmark / leaderboard / slides / Futurity news / JHU news
J2
How Far Are We From AGI? Tao Feng*, Chuanyang Jin*, Jingyu Liu*, Kunlun Zhu*, Haoqin Tu, Zirui Cheng, Guanyu Lin, Jiaxuan You TMLR 2024 / ICLR 2024 AGI Workshop (Oral) paper / project
C1
Neural Amortized Inference for Nested Multi-agent Reasoning Kunal Jha, Tuan Anh Le, Chuanyang Jin, Yen-Ling Kuo, Joshua Tenenbaum, Tianmin Shu AAAI 2024 / AAAI 2024 Summer Symposium (Oral) paper / project / code / slides
J1
Dynamics of RNA Localization to Nuclear Speckles are Connected to Splicing Efficiency Jinjun Wu*, Yu Xiao*, Yunzheng Liu*, Li Wen,Chuanyang Jin, Shun Liu, Sneha Paul, Chuan He, Oded Regev, Jingyi Fei Science Advances 10 (42), eadp7727 paper
Preprints and Under Review
P1
AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling Zhining Zhang*, Chuanyang Jin*†, Mung Yao Jia*, Tianmin Shu Preprint paper / project / code
Workshop and Technical Reports
W3
Humanity’s Last Exam Center for AI Safety & Scale AI Technical Report paper
W2
Beyond the Binary: Capturing Diverse Preferences With Reward Regularization Vishakh Padmakumar*,Chuanyang Jin*, Hannah Rose Kirk*, He He NeurIPS 2024 Workshop on Socially Responsible Language Modelling Research paper / code
W1
The Cultural Psychology of Large Language Models Chuanyang Jin*, Songyang Zhang*, Tianmin Shu, Zhihan Cui Technical Report paper
Open-Source Projects
O2
OpenCompass: A Universal Evaluation Platform for Foundation Models OpenCompass Team GitHub repository (Over 5k stars)
O1
Fast-DiT: Fast Diffusion Models with Transformers Chuanyang Jin, Saining Xie GitHub repository (Over 800 stars)