What to build
Add router-visible identity metadata for RDMA-local PD Engine/Decoder pairs. Once topology-aware placement defines the selected topology domain, OME should label or annotate Engine and Decoder pods so the router can discover matched prefill/decode pairs and avoid selecting an Engine from one rack or RDMA fabric with a Decoder from another.
The router-facing contract should keep placement and routing concerns separate: scheduling decides where pods land, while the router consumes stable pair and topology metadata to pick a local pair for prefill/decode work. This should support current service-discovery selector flows and leave room for future topology-aware routing or KV-transfer policies.
Acceptance criteria
Blocked by
What to build
Add router-visible identity metadata for RDMA-local PD Engine/Decoder pairs. Once topology-aware placement defines the selected topology domain, OME should label or annotate Engine and Decoder pods so the router can discover matched prefill/decode pairs and avoid selecting an Engine from one rack or RDMA fabric with a Decoder from another.
The router-facing contract should keep placement and routing concerns separate: scheduling decides where pods land, while the router consumes stable pair and topology metadata to pick a local pair for prefill/decode work. This should support current service-discovery selector flows and leave room for future topology-aware routing or KV-transfer policies.
Acceptance criteria
Blocked by