Research Focus
Designing LLM-based autonomous agents that collaborate effectively to tackle complex tasks (software development, self-evolution, and latent-space reasoning).
I am a second-year master's student in Computer Science at Zhejiang University, working with Prof. Wei Chen at ZJUVAI. I am also an LLM Algorithm Intern at ByteDance, following internships at Alibaba and Tsinghua University.
Prospective PhD programs: Seeking Ph.D. positions for Fall 2027. Open to collaborations. Prospective interns: Seeking Research Interns for LLMs & Agents. Email me.
News
Apr 2026
Three papers have been accepted to ACL 2026! Looking forward to see you in San Diego this July.
Selected Publications
(* denotes equal contribution)
Communication in Latent Space
Enabling Agents to Communicate Entirely in Latent Space.
A Self-Improving Framework for Multi-Agent Systems via Textual Parameter Graph Optimization.
Shan He, Runze Wang, Zhuoyun Du, Huiyu Bai, Zouying Cao, Yu Cheng, Bo Zheng. ACL 2026.
LLM Algorithm Intern (Jindouyun Talent Program) · Mar 2026 – Present
Focusing on LLM post-training (CT, SFT, RLHF/DPO) and constrained reinforcement learning in complex interactive environments. Researching preference alignment and multi-agent frameworks to tackle hallucination control, reward hacking, and policy compliance in high-stakes game-theoretic scenarios.
Alibaba Group
Research Intern @ Taotian Future Living Lab · Jan 2025 – Mar 2026
Spearheaded research on novel multi-agent collaborative paradigms, emphasizing latent space reasoning and communication efficiency. Designed and optimized Supervised Fine-Tuning (SFT) pipelines to substantially enhance LLM reasoning capabilities and task-solving performance in complex scenarios.
Tsinghua University
Research Assistant @ THUNLP · Nov 2023 – Aug 2024
Contributed to the ChatDev project and related studies with 32k stars on LLM-based multi-agent systems (especially Croto & MacNet), focusing on cross-agent and cross-team collaboration.
Featured Project
ChatDev & Croto
Pioneering LLM-based multi-agent collaboration. Actively contributed to the development and served as the leader for the Croto branch.
32,000+ Stars on GitHub. Featured by Andrew Ng & DeepMind.
A comprehensive collection of research papers on LLM-based multi-agent systems presented in an interactive e-book format. Organizes cutting-edge research into task-solving-oriented and social-simulation-oriented systems.
Beyond research, I enjoy soccer, fencing, snowboarding, billiards, playing the piano, photography, and physics.
Visit my personal blog. Last updated: March 2026.