Commit dad0388
committed
[Bugfix] Resolve MTP > 1 issue when lm head tp > 1
Previously, the dummy run executed compute_logits only once, regardless of num_speculative_tokens. This caused execute_model to hang on compute_logits when lm head tensor parallelism exceeded 1. The fix ensures compute_logits executes correctly during dummy run, matching num_speculative_tokens.
Signed-off-by: Jade Zheng <[email protected]>1 parent e985432 commit dad0388
File tree
3 files changed
+19
-9
lines changed- vllm_ascend
- spec_decode
- worker
3 files changed
+19
-9
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
136 | 136 | | |
137 | 137 | | |
138 | 138 | | |
139 | | - | |
| 139 | + | |
| 140 | + | |
140 | 141 | | |
141 | 142 | | |
142 | 143 | | |
| |||
148 | 149 | | |
149 | 150 | | |
150 | 151 | | |
| 152 | + | |
151 | 153 | | |
152 | 154 | | |
153 | 155 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
211 | 211 | | |
212 | 212 | | |
213 | 213 | | |
214 | | - | |
| 214 | + | |
| 215 | + | |
215 | 216 | | |
216 | 217 | | |
217 | 218 | | |
| |||
243 | 244 | | |
244 | 245 | | |
245 | 246 | | |
| 247 | + | |
246 | 248 | | |
247 | 249 | | |
248 | 250 | | |
| |||
665 | 667 | | |
666 | 668 | | |
667 | 669 | | |
| 670 | + | |
668 | 671 | | |
669 | 672 | | |
670 | 673 | | |
| |||
721 | 724 | | |
722 | 725 | | |
723 | 726 | | |
724 | | - | |
| 727 | + | |
725 | 728 | | |
726 | 729 | | |
727 | 730 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3049 | 3049 | | |
3050 | 3050 | | |
3051 | 3051 | | |
| 3052 | + | |
| 3053 | + | |
| 3054 | + | |
| 3055 | + | |
| 3056 | + | |
| 3057 | + | |
| 3058 | + | |
| 3059 | + | |
3052 | 3060 | | |
3053 | 3061 | | |
3054 | 3062 | | |
| |||
3068 | 3076 | | |
3069 | 3077 | | |
3070 | 3078 | | |
3071 | | - | |
3072 | | - | |
| 3079 | + | |
3073 | 3080 | | |
3074 | 3081 | | |
3075 | 3082 | | |
| |||
3079 | 3086 | | |
3080 | 3087 | | |
3081 | 3088 | | |
3082 | | - | |
3083 | | - | |
3084 | | - | |
3085 | | - | |
| 3089 | + | |
| 3090 | + | |
3086 | 3091 | | |
3087 | 3092 | | |
3088 | 3093 | | |
| |||
0 commit comments