Zhuoyun Du

Zhuoyun Du 杜卓耘

M.Eng. Student, Data Science and Engineering, Zhejiang University

State Key Laboratory of CAD&CG, advised by Prof. Wei Chen

Data Dept Talent Program Intern, ByteDance

duzy@zju.edu.cn

I am a master's student in Data Science and Engineering at Zhejiang University (Sept. 2024 - Jul. 2027 expected), advised by Prof. Wei Chen at ZJUVAI within the State Key Laboratory of CAD&CG. Before joining Zhejiang University, I received my B.Eng. degree in Computer Science from Jinan University. My research focuses on LLM-based autonomous agents, especially multi-agent collaboration, autonomous evolution, latent-space communication and reasoning, autonomous evaluation, and reasoning-oriented post-training.

I am currently a Talent Program intern at ByteDance, working on LLM post-training, preference alignment, interactive evaluation, and high-fidelity human-like agent simulation for SFT and RLHF/DPO data generation. Previously, I worked as a research intern at Alibaba Group's Future Living Lab and as a research assistant at THUNLP, Tsinghua University, working with Prof. Zhiyuan Liu, where I contributed to the ChatDev project.

Prospective PhD applications: I am actively seeking Ph.D. positions starting in Fall 2027. I am especially interested in labs working on foundation models, LLM agents, multi-agent systems, autonomous evolution and evaluation, latent-space communication and reasoning, and post-training. Please feel free to reach out if my background may align with your group.

Prospective research internships and collaborations: I am looking for research internship or collaboration opportunities around LLM agents, multi-agent systems, and latent-space communication.

Research Interests

News

Selected Publications

* denotes equal contribution.

Zhuoyun Du*, Runze Wang*, Huiyu Bai, Zouying Cao, Xiaoyong Zhu, Bo Zheng, Wei Chen, and Haochao Ying
Association for Computational Linguistics (ACL), 2026, oral recommended
Zhuoyun Du*, Lujie Zheng*, Renjun Hu, Yuyang Xu, Xiawei Li, Ying Sun, Wei Chen, Jian Wu, Haolei Cai, and Haohao Ying
Association for Computational Linguistics (ACL), 2025
Zhuoyun Du*, Chen Qian*, Wei Liu, Zihao Xie, Yifei Wang, Yufan Dang, Weize Chen, and Cheng Yang
Findings of the Association for Computational Linguistics (ACL Findings), 2025
Learning to Evolve: A Self-Improving Framework for Multi-Agent Systems via Textual Parameter Graph Optimization
Shan He, Runze Wang, Zhuoyun Du, Huiyu Bai, Zouying Cao, Yucheng, and Bo Zheng
Findings of the Association for Computational Linguistics (ACL Findings), 2026
MemCoRL: Alternating Co-Optimization of Memory Retrieval and Utilization via Collaborative Reinforcement Learning
Yuewen Liu, Peng Xu, Zhuoyun Du, Muxi Diao, Anyi Zhang, Yang Li, and Yutong Zhang
Association for Computational Linguistics (ACL), 2026
Huiyu Bai*, Runze Wang*, Zhuoyun Du*, Yiyang Zhao, Fengji Zhang, Haoyu Chen, Xiaoyong Zhu, Bo Zheng, and Xuejiao Zhao
arXiv preprint, 2025
Chen Qian*, Zihao Xie*, Yifei Wang*, Wei Liu, Yufan Dang, Zhuoyun Du, Weize Chen, Cheng Yang, Zhiyuan Liu, and Maosong Sun
International Conference on Learning Representations (ICLR), 2025
Wei Liu*, Chenxi Wang*, Yifei Wang, Zihao Xie, Rennai Qiu, Yufan Dang, Zhuoyun Du, Weize Chen, Cheng Yang, and Chen Qian
Conference on Neural Information Processing Systems (NeurIPS), 2024
Yuyang Xu, Yi Cheng, Haochao Ying, Zhuoyun Du, Renjun Hu, Xing Shi, Wei Lin, and Jian Wu
arXiv preprint, 2025

Experience

Talent Program Intern, ByteDance (Mar. 2026 - present)
  • Built the team's first data-flywheel rollout system for multi-turn, task-oriented dialogue, supporting downstream multi-turn dialogue RL and agentic RL experiments.
  • Designed self-play simulation and evaluation environments with high-fidelity, persona-conditioned user-simulation agents.
  • Worked on LLM post-training, preference alignment, and autonomous agent-driven evaluation for dialogue policies.
Research Intern, Future Living Lab (now part of Token Foundry), Alibaba Group (Jan. 2025 - Mar. 2026)
  • Proposed Interlat (code), a latent-space communication framework that replaces natural-language message passing with temporally aligned hidden-state exchange.
  • Designed a token-latent curriculum and learned compression scheme, achieving up to 24x communication speedup at comparable task performance.
  • Gained hands-on experience with distributed training on 128+ GPUs; co-authored Online-PVLM, MemCoRL, and Learning to Evolve.
M.Eng. Research, Zhejiang University, State Key Lab of CAD&CG (Sept. 2024 - present)
  • Led EvoPatient (code), an open-source doctor-patient coevolution system for virtual standardized patients, accepted to ACL 2025.
  • Drove the project end-to-end, including research formulation, system implementation, medical-case curation, expert evaluation, and paper writing.
  • Improved patient-answer ability from 0.763 to 0.860 after 200 training cases.
Research Assistant, THUNLP, Tsinghua University (Dec. 2023 - Aug. 2024)
  • Worked with Prof. Zhiyuan Liu and Prof. Chen Qian on ChatDev-based multi-agent collaboration, studying role organization, information exchange, and coordination.
  • Led Croto (code), designing cross-team orchestration with parallel teams, hierarchy partitioning, and greedy aggregation for multi-agent software development.
  • Contributed to MACNet (code) and iAgent (code); related OpenBMB/ChatDev projects have accumulated over 33k GitHub stars.

Awards and Honors

Academic Service

Miscellaneous

Beyond research, I enjoy soccer, fencing, snowboarding, billiards, ballroom dancing, piano, photography, and physics. I occasionally write notes on my personal blog.

Visitors: loading Views: loading