Projects | Zongqing's Homepage

Peking University Reinforcement Learning Lab

The lab focuses on reinforcement learning and building generalist agents. Many thanks to our sponsors including NSFC, Huawei, Tencent, Alibaba, ByteDance, Hikvision, and inspir.ai.

Students

PhD Students

Haobin Jiang [Fall, 2020]
Yicheng Feng [Fall, 2021]
Haoqi Yuan [Fall, 2021]
Xiaopeng Yu [Fall, 2022]
Wanpeng Zhang [Fall, 2022]
Hao Luo [Fall, 2022]
Jiazheng Liu [Fall, 2023]
Junpeng Yue [Fall, 2023]
Chi Zhang [Fall, 2024]
Yuxuan Wang [Fall, 2024]
Feiyang Xie [Fall, 2025]
Penglin Cai [Fall, 2025]

Master Students

Bohan Zhou [Fall, 2023]
Yuhui Fu [Fall, 2024]

RAs/Interns

Alumni

Jiangxing Wang (PhD, 2025), BeingBeyond
Kefan Su (PhD, 2025), ByteDance
Jiafei Lyu (THU PhD, co-advised, 2025), Tencent
Jiechuan Jiang (PhD, 2024), Wizard Quant
Zhongdi Songshan (Master, 2024), Alibaba

Current Projects

Embodied Intelligence

Embodied intelligence focuses on creating robots that can perceive, reason, and act in the physical world. It brings together dexterous hand manipulation, humanoid whole-body control, and foundation models that link perception, language, and action. By combining these elements, embodied AI aims to enable robots that can generalize and perform complex real-world tasks with human-like adaptability. Dexhand Manipulation Dexterous hand manipulation focuses on enabling robots to interact with objects with the precision, adaptability, and coordination of human hands. [Read More…]

Embodied Intelligence, Reinforcement Learning, Foundation Models

Past Projects

Generalist Agents

Recently developed foundation models, such as large language models and multi-modal models, open great opportunities to build generally capable agents, combined with reinforcement learning. This project focuses on learning skills and foundation models and connecting them to build generalist agents. In the following, we introduce some of our studies. For more details, please refer to the papers. Plan4MC We study building a multi-task agent in Minecraft. Without human demonstrations, solving long-horizon tasks in this open-ended environment with reinforcement learning (RL) is extremely sample inefficient. [Read More…]

Reinforcement Learning, Foundation Models

RL/Multi-Agent RL

Multi-Agent Reinforcement Learning (MARL) has recently attracted much attention from the communities of machine learning, artificial intelligence, and multi-agent systems. As an interdisciplinary research field, there are so many unsolved problems, from cooperation to competition, from agent communication to agent modeling, from centralized learning to decentralized learning. MARL has been the main research focus of our lab. We are investigating the field from many perspectives. In the following, we introduce some of our studies. [Read More…]

Reinforcement Learning, Multiagent Learning

RL/MARL Applications

Reinforcement learning (RL) has the potential be applied to many real-world applications. In our research, we also investigate the applications of RL and Multi-agent RL. Currently, we have been investigating two applications: one is traffic signal control; another is EDA. Traffic signals coordinating traffic movements are the key for transportation efficiency. However, conventional traffic signal control that heavily relies on pre-defined rules and assumptions on traffic conditions is far from intelligence. [Read More…]

Reinforcement Learning, Applications

Distributed Video Processing Using Deep Learning on Networked Devices

The vast adoption of mobile devices with cameras has greatly assisted in the proliferation of the creation and distribution of videos. Videos, which are a rich source of information, can be exploited for on-demand information retrieval. Deep learning using Convolutional Neural Networks (CNNs) is state of the art computer vision techniques that can be used for information retrieval. However, due to the high computation of video processing using CNNs, it is not feasible or costs too much to process all videos at a centralized entity, considering a large set of videos which is common in this big data epoch. [Read More…]

Deep Learning, Edge Computing

Building Smartphone Networks

Smartphones have great networking capabilities. They can access the Internet through cellular networks or wireless access points and communicate with nearby devices using WiFi Direct or Bluetooth. However, these network functions may not work in some circumstances where cellular towers and network infrastructure are destroyed, e.g. in disaster recovery. Nevertheless, communications in such scenarios are very important, and hence, in this research, we aim to build smartphone networks to provide communications without relying on cellular networks, wireless access points, or network infrastructure. [Read More…]

Smartphones, Opportunistic Networking, Data Offload

Health Sensing Using Mobile Devices

Mobile devices, such as smartphones, have become commonplace in health care settings, leading to the development of both platforms and applications for health care, e.g., HealthKit on iOS, where apps can collect users’ health and activity data and the data will be used for medical research to bring more powerful health solutions. However, no data is collected for the research of infectious diseases. Moreover, currently, most health data are collected by manual input or external devices. [Read More…]

Infectious Diseases, Human Contact Networks, Respiratory Symptoms, Smartphones