Skip to content

Pull requests: lightseekorg/tokenspeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix(sampling): fused_topk_topp PDL race causing IMA
#536 opened Jun 26, 2026 by jaywme Collaborator Loading…
Fix gathered MXFP4 activation scales in Gluon MoE
#534 opened Jun 26, 2026 by qedawkins Contributor Loading…
test: glm-5.2 agentic bench
#532 opened Jun 26, 2026 by syuoni Member Draft
[WIP] Initial glm 5.2 support on amd
#528 opened Jun 26, 2026 by borontion Contributor Draft
fix(spec): drive EAGLE3 aux capture layers from the draft config
#526 opened Jun 26, 2026 by jaywme Collaborator Loading…
feat(log): add --log-request-stats per-request statistics logging
#512 opened Jun 24, 2026 by LorrinWWW Contributor Loading…
[WIP] feat:support qwen3.5 dflash
#510 opened Jun 24, 2026 by minedec Contributor Draft
test(agentic): add EvalScope trie benchmark protocol
#466 opened Jun 17, 2026 by Xiangyi1996 Collaborator Draft
test(ci): add DeepSeek-V4-Flash MTP AIME25 eval
#461 opened Jun 16, 2026 by dongjiyingdjy Contributor Loading…
test: add dp4ep4 case in CI
#453 opened Jun 15, 2026 by tuanzhangCS Contributor Draft
[WIP] Refactor Cache Management
#447 opened Jun 15, 2026 by wangbo981016 Contributor Draft
Fix EP8 DP/TP RSAG init and empty LM head
#416 opened Jun 11, 2026 by yubofredwang Contributor Loading…
perf(gdn): fuse causal_conv1d and QKV split for GDN prefill
#382 opened Jun 8, 2026 by elwhyjay Contributor Loading…
Add Triton sampling backends alongside FlashInfer inactive
#280 opened May 27, 2026 by FlamingoPg Contributor Loading…
feat(trtllm-MHA): support mixed prefill/decode batches
#176 opened May 18, 2026 by rjzhb Collaborator Draft
4 tasks done
ProTip! What’s not been updated in a month: updated:<2026-05-26.