-
Notifications
You must be signed in to change notification settings - Fork 10
Pull requests: AMD-AGI/TraceLens
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add perf model for MambaSplitConv1dScanCombinedFn (Mamba-2 SSD)
perf_model
Add performance model for calculating TFLOPS/s and TB/s
#553
opened Mar 20, 2026 by
gphuang
Loading…
1 task done
Add perf model and categorization for CK grouped GEMM, SSM, MoE, RoPE, and CrossEntropy ops
perf_model
Add performance model for calculating TFLOPS/s and TB/s
#550
opened Mar 19, 2026 by
gphuang
Loading…
1 task done
Add perf model coverage for TE FusedAttnFunc (SDPA)
perf_model
Add performance model for calculating TFLOPS/s and TB/s
#549
opened Mar 19, 2026 by
gphuang
Loading…
1 task done
Add perf model coverage for TE _Linear, _LayerNormLinear, and LayerNormFn ops
perf_model
Add performance model for calculating TFLOPS/s and TB/s
#548
opened Mar 19, 2026 by
gphuang
Loading…
1 task done
Create performance model for reduction operators
perf_model
Add performance model for calculating TFLOPS/s and TB/s
#545
opened Mar 18, 2026 by
gabeweisz
Loading…
Add perf model coverage for DeepEP EP communication ops
perf_model
Add performance model for calculating TFLOPS/s and TB/s
#520
opened Mar 10, 2026 by
gphuang
Loading…
Perform a second pass in unified table creation to find GPU events wi…
#511
opened Mar 6, 2026 by
devalshahamd
Loading…
get_df_xla_perf raises KeyError for FP8 and s8 dtypes in XLA kernel operands
#447
opened Dec 12, 2025 by
brieflynn
Loading…
Docs Update: Perf Model Add and Local Test Run Instructions
#389
opened Oct 16, 2025 by
spandoesai
Loading…
Capture the overlapping events for every event and overall exposed computation time
#382
opened Oct 15, 2025 by
araina-amd
Loading…
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.