Skip to content

chore: pin vllm-rs ref and report decode stats#461

Merged
gufengc merged 5 commits into
mainfrom
codex/vllm-rs-version-sidecar
May 29, 2026
Merged

chore: pin vllm-rs ref and report decode stats#461
gufengc merged 5 commits into
mainfrom
codex/vllm-rs-version-sidecar

Conversation

@gufengc
Copy link
Copy Markdown
Collaborator

@gufengc gufengc commented May 29, 2026

Summary

  • Pin the default vLLM source ref used by install.sh for building vllm-rs to v0.22.0.
  • Write a vllm-rs.version sidecar next to the installed binary after a successful build.
  • Rebuild vllm-rs when an existing binary is missing the sidecar or the sidecar does not match the requested VLLM_REF; skip only when the recorded version matches.

Why

The installer previously skipped rebuilding whenever .venv/bin/vllm-rs already existed, regardless of which vLLM ref it came from. That made it easy to keep using an older or different binary after changing VLLM_REF.

Validation

  • bash -n install.sh
  • ./install.sh --extras mac to rebuild vllm-rs from v0.22.0
  • .venv/bin/vllm-rs --help
  • .venv/bin/vllm-rs frontend --help
  • .venv/bin/python -m pytest tests/test_vllm_rust_frontend.py tests/test_engine_core_protocol.py tests/test_backend_request_handler.py
  • .venv/bin/pre-commit run --all-files
  • .venv/bin/pytest (125 passed, 5 skipped)

@gufengc gufengc marked this pull request as ready for review May 29, 2026 15:56
@gufengc gufengc changed the title [codex] Pin vllm-rs install ref [codex] Pin vllm-rs ref and report decode stats May 29, 2026
@gufengc gufengc requested a review from a team May 29, 2026 15:56
@gufengc gufengc changed the title [codex] Pin vllm-rs ref and report decode stats chore: pin vllm-rs ref and report decode stats May 29, 2026
@gufengc gufengc merged commit 5b2578e into main May 29, 2026
11 checks passed
@gufengc gufengc deleted the codex/vllm-rs-version-sidecar branch May 29, 2026 16:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant