GitHub - hustvl/RAD: [NeurIPS 2025] RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

RAD: Training an End-to-End Driving Policy via Large-Scale
3DGS-based Reinforcement Learning

Hao Gao¹, Shaoyu Chen^1,2,†, Bo Jiang¹, Bencheng Liao¹, Yiang Shi¹, Xiaoyang Guo², Yuechuan Pu², Haoran Yin², Xiangyu Li², Xinbang Zhang², Ying Zhang², Wenyu Liu¹, Qian Zhang², Xinggang Wang^1,📧

¹ Huazhong University of Science and Technology, ² Horizon Robotics, ^† Project lead ^📧 Corresponding author

📰 News

[2025.09.28] We have released core code for RL training.
[2025.09.18] RAD has been accepted by NeurIPS 2025! 🎉🎉🎉
[2025.02.18] We released our paper on Arxiv. Code are coming soon. Please stay tuned! ☕️

🎯 How to Use

Project Structure

.
├── data/                        # Action anchors for planning/control
├── compute_advantage.py         # Script for computing RL advantages and evaluation metrics
├── generate_action_anchor.py    # Script for generating action anchors for planning/control
├── planning_head.py             # Planning head module
└── README.md

Run Key Scripts

# You can quickly test the core functionality by running the provided scripts.
# Generate action anchors
python generate_action_anchor.py

# Run the planning head module
python planning_head.py

# Compute advantage metrics
python compute_advantage.py

Using Your Own Data

To integrate this project into your pipeline and use your own data, follow these steps:

Replace the Planning Head
Use planning_head.py to replace the head of your end-to-end algorithm.

Prepare the Closed-Loop Environment
Set up your closed-loop environment and collect closed-loop data.

Compute Advantages and Train the Model
Use compute_advantage.py to calculate advantage values from the collected data, and then use them for model training.

📌 RAD Training Discussion & Reference

We have created a central discussion issue for RAD training details. You can view and participate in the discussion here: RAD Training Details Issue

📚 Citation

If you find RAD useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry.

@article{RAD,
  title={RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning},
  author={Gao, Hao and Chen, Shaoyu and Jiang, Bo and Liao, Bencheng and Shi, Yiang and Guo, Xiaoyang and Pu, Yuechuan and Yin, Haoran and Li, Xiangyu and Zhang, Xinbang and Zhang, Ying and Liu, Wenyu and Zhang, Qian and Wang, Xinggang},
  journal={arXiv preprint arXiv:2502.13144},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
assets		assets
data		data
LICENSE		LICENSE
README.md		README.md
compute_advantage.py		compute_advantage.py
generate_action_anchor.py		generate_action_anchor.py
planning_head.py		planning_head.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAD: Training an End-to-End Driving Policy via Large-Scale
3DGS-based Reinforcement Learning

📰 News

🎯 How to Use

📌 RAD Training Discussion & Reference

📚 Citation

About

Uh oh!

Releases

Packages

Languages

License

hustvl/RAD

Folders and files

Latest commit

History

Repository files navigation

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

📰 News

🎯 How to Use

📌 RAD Training Discussion & Reference

📚 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

RAD: Training an End-to-End Driving Policy via Large-Scale
3DGS-based Reinforcement Learning

Packages