Skip to content

Pull requests: ROCm/triton

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Update FlashAttention transV scripts
#766 opened Mar 21, 2025 by binarman Loading…
Add v2 test to paged_attention_decode
#764 opened Mar 20, 2025 by rahulbatra85 Loading…
MLA prefill, forward_normal benchmark
#750 opened Mar 7, 2025 by Chi-Chu319 Loading…
Cap warp count to 16 for devices with warp size 64
#747 opened Mar 5, 2025 by schung-amd Draft
4 of 7 tasks
Add int4 quantization support to MoE
#715 opened Jan 28, 2025 by rahulbatra85 Loading…
Fused moe gemm + silu activation kernel
#710 opened Jan 23, 2025 by Chi-Chu319 Loading…
4 of 6 tasks
Layernorm changes
#681 opened Dec 12, 2024 by vgokhale Loading…
Added CK-gemm runner
#674 opened Dec 6, 2024 by ravil-mobile Loading…
Perf Kernels benchmark workflow
#651 opened Oct 29, 2024 by NISHIY-EKSDEE Draft
Use mask during load for Softmax
#645 opened Sep 24, 2024 by rahulbatra85 Loading…
RMSNorm Blocked Implementation
#638 opened Sep 12, 2024 by rahulbatra85 Loading…
Add INT4 quant/de-quant kernels
#620 opened Jul 29, 2024 by rahulbatra85 Loading…
[CODE SHARING] Ravil/sched inst
#611 opened Jul 10, 2024 by ravil-mobile Draft
ProTip! Type g i on any issue or pull request to go back to the issue listing page.