[Bugfix] fixes the decoding metadata of dense mla's fp8 kvcache. #27144

sighingnow · 2025-10-18T06:11:51Z

Require the flashmla patch vllm-project/FlashMLA#7 to be landed first.

gemini-code-assist

Code Review

This pull request fixes an issue with decoding metadata for dense MLA's FP8 K/V cache by introducing a specialized operator. The changes in the Python code correctly route the execution to this new operator when appropriate. However, there is a critical issue in the CMake configuration where the flashmla dependency is pointed to a personal fork. This practice introduces significant risks and should be rectified by merging the required changes into the official upstream repository and updating the commit hash accordingly.

gemini-code-assist · 2025-10-18T06:12:25Z

cmake/external_projects/flashmla.cmake

+        GIT_REPOSITORY https://github.com/sighingnow/FlashMLA
+        GIT_TAG 7af725e6c2a3f0262e5b8573c715411a6d895cae


Pointing the GIT_REPOSITORY to a personal fork (sighingnow/FlashMLA) introduces a significant dependency risk. For project stability, security, and long-term maintainability, dependencies should point to official repositories. The required changes should be merged into the official vllm-project/FlashMLA repository first. Afterward, this pull request can be updated to use the new commit hash from the official repository.

GIT_REPOSITORY https://github.com/vllm-project/FlashMLA GIT_TAG <new_commit_hash_from_official_repo>

Signed-off-by: Tao He <[email protected]>

sighingnow · 2025-10-20T05:16:27Z

@LucasWilkinson could you please take a look? Thanks!

Signed-off-by: Lucas Wilkinson <[email protected]>

LucasWilkinson

LGTM; will run evals and post here

edit: gsm8k looks good: DeepSeek-V2-Lite-Chat with fp8 kv-cache, 64.6%

…m-project#27144) Signed-off-by: Tao He <[email protected]> Signed-off-by: Lucas Wilkinson <[email protected]> Co-authored-by: Lucas Wilkinson <[email protected]>

sighingnow requested a review from LucasWilkinson as a code owner October 18, 2025 06:11

gemini-code-assist bot reviewed Oct 18, 2025

View reviewed changes

mergify bot added ci/build v1 labels Oct 18, 2025

[Bugfix] fixes the decoding metadata of dense mla's fp8 kvcache.

aa99183

Signed-off-by: Tao He <[email protected]>

sighingnow force-pushed the fixes-dense-mla-fp8kv branch from 30a5293 to aa99183 Compare October 19, 2025 14:37

Merge branch 'main' into fixes-dense-mla-fp8kv

147fc5e

sighingnow and others added 2 commits October 21, 2025 22:56

Merge branch 'main' into fixes-dense-mla-fp8kv

3f3fb7a

update FlashMLA

bbe0b16

Signed-off-by: Lucas Wilkinson <[email protected]>

LucasWilkinson requested a review from pavanimajety as a code owner October 21, 2025 15:15

get rid of unrelated changes

fe47329

Signed-off-by: Lucas Wilkinson <[email protected]>

LucasWilkinson approved these changes Oct 21, 2025

View reviewed changes

LucasWilkinson enabled auto-merge (squash) October 21, 2025 16:17

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 21, 2025

LucasWilkinson merged commit 250fb1b into vllm-project:main Oct 21, 2025
88 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] fixes the decoding metadata of dense mla's fp8 kvcache. #27144

[Bugfix] fixes the decoding metadata of dense mla's fp8 kvcache. #27144

Uh oh!

sighingnow commented Oct 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 18, 2025

Uh oh!

sighingnow commented Oct 20, 2025

Uh oh!

LucasWilkinson left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		GIT_REPOSITORY https://github.com/sighingnow/FlashMLA
		GIT_TAG 7af725e6c2a3f0262e5b8573c715411a6d895cae

Uh oh!

[Bugfix] fixes the decoding metadata of dense mla's fp8 kvcache. #27144

[Bugfix] fixes the decoding metadata of dense mla's fp8 kvcache. #27144

Uh oh!

Conversation

sighingnow commented Oct 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

sighingnow commented Oct 20, 2025

Uh oh!

LucasWilkinson left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LucasWilkinson left a comment •

edited

Loading