[WIP - No Merge] Refactor DrivePolicy architecture and configuration by vcharraut · Pull Request #470 · Emerge-Lab/PufferDrive

vcharraut · 2026-06-02T20:29:47Z

Updated DrivePolicy to use shared network architecture instead of split network.
Changed input size parameters to specific sizes for ego, partner, lane, boundary, traffic control, and conditioning inputs.
Modified encoder configuration to include activation functions and layer normalization options.
Removed gigaflow architecture in favor of a more flexible encoder design.
Adjusted observation size calculations to include counts of various features.
Updated environment bindings and configuration files to reflect new parameter names and structures.
Enhanced the DriveBackbone class to support new encoder configurations and pooling mechanisms.
Updated the Drive class to accommodate changes in the backbone initialization and observation encoding.

- Updated DrivePolicy to use shared network architecture instead of split network. - Changed input size parameters to specific sizes for ego, partner, lane, boundary, traffic control, and conditioning inputs. - Modified encoder configuration to include activation functions and layer normalization options. - Removed gigaflow architecture in favor of a more flexible encoder design. - Adjusted observation size calculations to include counts of various features. - Updated environment bindings and configuration files to reflect new parameter names and structures. - Enhanced the DriveBackbone class to support new encoder configurations and pooling mechanisms. - Updated the Drive class to accommodate changes in the backbone initialization and observation encoding.

Copilot

Pull request overview

Refactors the DrivePolicy/DriveBackbone architecture to support per-input encoders with configurable activations/layer-norm, introduces explicit masking of padded slots via appended per-layer “valid counts”, and updates environment/config/notebook wiring to match the new observation layout and policy kwargs.

Changes:

Replace the prior (gigaflow vs standard) encoder logic with a unified encoder design supporting configurable activation + optional LayerNorm, and switch to a shared-network actor/critic option.
Extend the Drive observation vector with 4 appended count features (lane/boundary/partner/traffic) and use these counts for optional padding-masking during pooling.
Update drive.ini defaults and notebooks/utilities to use the new policy parameter names and updated observation semantics.

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
pufferlib/ocean/torch.py	Implements the new DriveBackbone/Drive policy wiring: per-encoder widths, activation/LN config, masking-aware pooling, and shared-network actor/critic behavior.
pufferlib/ocean/env_binding.h	Exposes `OBS_COUNT_FEATURES` to Python so Python-side slicing/masking can match the C observation layout.
pufferlib/ocean/drive/drive.py	Updates Python Drive env observation sizing and config surface to include appended count features and shared-network flag.
pufferlib/ocean/drive/drive.h	Updates C-side observation layout (partner features + appended counts) and writes the per-layer slot counts into the observation buffer.
pufferlib/config/ocean/drive.ini	Renames/remaps policy parameters to the new per-encoder sizes + activation/LN options; switches to `shared_network`.
notebooks/notebook_utils.py	Updates notebook policy defaults to the new kwargs.
notebooks/06_architecture.ipynb	Refreshes architecture visualization/benchmarking code to the new encoder/backbone configuration surface.
notebooks/05_inference.ipynb	Updates observation documentation/visuals for the new partner feature set.
notebooks/01_observations.ipynb	Updates manual slicing checks to account for the 4 appended features at the end of the observation vector.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

…s the codebase

…rchitecture, and utility files - Updated references from `conditioning_dim` to `target_dim` in training and inference notebooks. - Changed `conditioning_input_size` to `target_input_size` in configuration files and utility scripts. - Adjusted encoder creation and usage in the DriveBackbone class to reflect the new target terminology. - Ensured consistency across all relevant files to improve clarity and maintainability.

Copilot AI review requested due to automatic review settings June 2, 2026 20:29

Copilot started reviewing on behalf of vcharraut June 2, 2026 20:29 View session

Copilot AI reviewed Jun 2, 2026

View reviewed changes

Comment thread pufferlib/ocean/torch.py

Comment thread pufferlib/ocean/torch.py

Comment thread pufferlib/ocean/drive/drive.h

Comment thread notebooks/01_observations.ipynb Outdated

vcharraut and others added 5 commits June 2, 2026 23:32

Update tests

a9952e5

Potential fix for pull request finding

212fd0e

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Rename mask_padded_observations to mask_padded_features for consistency

b3c91f0

Rename OBS_COUNT_FEATURES to OBS_SLOT_NUM_TYPES for consistency acros…

7186131

…s the codebase

vcharraut changed the title ~~Refactor DrivePolicy architecture and configuration~~ [WIP - No Merge] Refactor DrivePolicy architecture and configuration Jun 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP - No Merge] Refactor DrivePolicy architecture and configuration#470

[WIP - No Merge] Refactor DrivePolicy architecture and configuration#470
vcharraut wants to merge 6 commits into
emerge/temp_trainingfrom
vcha/encoders

vcharraut commented Jun 2, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vcharraut commented Jun 2, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants