[CVPR 2026] SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation

🎉🎉CVPR 2026 Oral🎉🎉

This is the official repository for SocialNav, a foundational model for socially-aware embodied navigation with a hierarchical brain–action architecture. SocialNav unifies high-level social norm understanding with low-level, socially compliant trajectory generation.

📢 Note: Pre-trained models (Hugging Face) and the CityWalker benchmark evaluation script are available in this repository.

🌐 Project Page

For an overview of the project, figures, and teaser video, please visit the project page:

👉 Project Page: https://amap-eai.github.io/SocialNav/

🔍 Overview

SocialNav is designed to address socially-aware navigation in real-world environments by:

Combining a VLM-based Brain for high-level semantic and social reasoning
With a flow-based Action Expert for low-level trajectory generation
Training on the large-scale SocNav Dataset (7M samples) and evaluating on the SocNav Benchmark

Key components include:

SocNav Dataset
- Expert Trajectories Pyramid (ETP)
- Cognitive Activation Dataset (CAD)
SocNav Benchmark
- High-fidelity evaluation built on Isaac Sim + 3DGS
- 9 large-scale social scenes (parks, streets, offices, campus)
- Metrics for both navigation performance and social compliance

🤗 Models

SAFE-GRPO checkpoints:

Backbone	ModelScope	Hugging Face
Qwen2-VL	SocialNav-Qwen2-VL-SAFE-GRPO	SocialNav-Qwen2-VL-SAFE-GRPO
Qwen2.5-VL	SocialNav-Qwen2.5-VL-SAFE-GRPO	SocialNav-Qwen2.5-VL-SAFE-GRPO
Qwen3-VL	Coming soon	Coming soon

📦 Installation

Requirements

OS: Linux (recommended) or macOS; GPU inference requires NVIDIA CUDA
Python: 3.10 / 3.11 (match requirements.txt)
CUDA: Version compatible with your PyTorch wheels (e.g. cu12)

1. Clone

git clone https://github.com/AMAP-EAI/SocialNav.git
cd SocialNav

2. Virtual environment (optional)

python -m venv .venv
source .venv/bin/activate

3. PyTorch

Install from pytorch.org for your CUDA version, e.g.:

pip install torch torchvision --index-url https://download.pytorch.org/whl/cu124

4. Dependencies

pip install -r requirements.txt

For flow-matching components, if missing from requirements.txt:

pip install torchcfm diffusers

5. Local Transformers (required)

This repo ships a patched Qwen3-VL + Flow Matching tree under transformers/. Install in editable mode:

pip install -e "./transformers[dev]"   # or: pip install -e ./transformers

6. `PYTHONPATH`

If modeling_qwen3_vl.py imports src.train.sde_with_logprob, run from the repo root or set:

export PYTHONPATH="/path/to/SocialNav:${PYTHONPATH}"

7. Flash Attention (optional)

Install flash-attn-2 per Qwen3-VL docs if you need lower memory; otherwise PyTorch SDPA is fine.

📊 Evaluation

Script: utils/citywalker.py
Primary metric: mean_angle in metrics_citywalker_qwen3.csv. Per sample, the script takes the maximum over five steps of the angle (degrees) between predicted and GT waypoint vectors; mean_angle in the CSV is the mean of that value over included samples (by row: categories, overall, and mean). Implementation: compute_sample_metrics and the mean_angle lists in main.

Input (jsonl, one record per line):

Field	Description
`images`	List of local image paths
`messages[0].content`	User text
`messages[1].gt_waypoints`	`(5, 2)`
`messages[1].input_waypoints`	`(6, 2)`
`messages[1].step_scale`	`float`
`messages[1].arrive`	`[0]` or `[1]`
`messages[1].categories`	Aligned with `TEST_CATEGORIES` in the script

Run: Set MODEL_PATH, DATA_PATH, and DEVICE at the top of the script, then:

cd /path/to/SocialNav
export PYTHONPATH="$(pwd):${PYTHONPATH}"
CUDA_VISIBLE_DEVICES=0 python utils/citywalker.py

Outputs: pred_citywalker_qwen3.jsonl, metrics_citywalker_qwen3.csv (default under MODEL_PATH/infer_result_citywalker_qwen3_fast_step_5/; see OUTPUT_DIR in the script).

📝 Citation

If you find this project useful in your research, please consider citing (to appear at CVPR 2026):

@article{chen2025socialnav,
      title={SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation},
      author={Ziyi Chen and Yingnan Guo and Zedong Chu and Minghua Luo and Yanfen Shen and Mingchao Sun and Junjun Hu and Shichao Xie and Kuan Yang and Pei Shi and Zhining Gu and Lu Liu and Honglin Han and Xiaolong Wu and Mu Xu and Yu Zhang},
      journal={arXiv preprint arXiv:2511.21135},
      year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.vscode		.vscode
scripts		scripts
src		src
transformers		transformers
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[CVPR 2026] SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation

🌐 Project Page

🔍 Overview

🤗 Models

📦 Installation

Requirements

1. Clone

2. Virtual environment (optional)

3. PyTorch

4. Dependencies

5. Local Transformers (required)

6. `PYTHONPATH`

7. Flash Attention (optional)

📊 Evaluation

📝 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

[CVPR 2026] SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation

🌐 Project Page

🔍 Overview

🤗 Models

📦 Installation

Requirements

1. Clone

2. Virtual environment (optional)

3. PyTorch

4. Dependencies

5. Local Transformers (required)

6. PYTHONPATH

7. Flash Attention (optional)

📊 Evaluation

📝 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

6. `PYTHONPATH`

Packages