Two papers on RL, COREP and PAR, were accepted at ICML’24, and one paper about RL with natural language action space, MIPO, was accepted at ACL’24

Two papers about fine-tuning LLMs for decision-making tasks, AdaRefiner and LLaMA-Rider, were accepted at NAACL’24.

Three papers respectively on LMMs, policy pre-training and offline RL were accepted at ICLR’24. Additionally, we have one paper accepted at TMLR, two at AAAI’24, and two at AAMAS’24. Congratulations to all.

Recent Publications

More Publications

Annual Meeting of the Association for Computational Linguistics, August 11-16, 2024.

Forty-first International Conference on Machine Learning (ICML), July 21-27, 2024 (Acceptance Rate: 27.5%=26099473)

Forty-first International Conference on Machine Learning (ICML), July 21-27, 2024 (Acceptance Rate: 27.5%=26099473)

Recently developed foundation models, such as large language models and multi-modal models, open great opportunities to build generally capable agents, combined with reinforcement learning. This project focuses on learning skills and foundation models and connecting them to build generalist agents. In the following, we introduce some of our studies. For more details, please refer to the papers. Plan4MC We study building a multi-task agent in Minecraft. Without human demonstrations, solving long-horizon tasks in this open-ended environment with reinforcement learning (RL) is extremely sample inefficient. [Read More…]

Multi-Agent Reinforcement Learning (MARL) has recently attracted much attention from the communities of machine learning, artificial intelligence, and multi-agent systems. As an interdisciplinary research field, there are so many unsolved problems, from cooperation to competition, from agent communication to agent modeling, from centralized learning to decentralized learning. MARL has been the main research focus of our lab. We are investigating the field from many perspectives. In the following, we introduce some of our studies. [Read More…]


Undergraduate Courses

  • Foundation Models and Agents, Fall 2024
  • Algorithms, Spring 2019, 2020, 2021, 2022
  • Data Structures and Algorithms, Spring 2018
  • Introduction to Computer Systems, Fall 2017

Gradudate Courses

  • Deep Reinforcement Learning, Spring 2020, 2021, 2022, 2023, 2024


Area Chair and Editorship

  • Area Chair, ICLR 2024, ICML 2024, NeurIPS 2024

  • Associate Editor, ACM Journal on Autonomous Transportation Systems, 2022 -

  • Guest Editor, Machine Learning Special Issue on RL for Real Life, 2023

Conference Organization

  • IEEE Conference on Games 2022, Keynote Co-Chair

  • ICML 2021 Workshop on Reinforcement Learning for Real Life, General Co-Chair

  • INFOCOM 2020 Worksop on Network Intelligence, General Co-Chair

  • ACM TURC 2018, Award Co-Chair

Program Committee Member (Reviewer)

  • ICLR 2023 2022 2021 2020, NeurIPS 2023 2022 2021 2020, ICML 2023 2022 2021, AAAI 2022, IJCAI 2022 2021 2020, AAMAS 2020, CoRL 2020
  • Nature Machine Intelligence


  • Room 523, Yanyuan Building, Peking University, Beijing, 100871, China.

  • Office hour: please drop me an email to schedule