Skip to content
Change the repository type filter

All

    Repositories list

    • This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.
      LLVM
      15k1762712Updated Oct 4, 2025Oct 4, 2025
    • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
      C++
      2394693399Updated Oct 4, 2025Oct 4, 2025
    • Go
      35741310Updated Oct 4, 2025Oct 4, 2025
    • super repo for rocm systems projects
      C++
      46103136416Updated Oct 4, 2025Oct 4, 2025
    • TheRock

      Public
      The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm
      Python
      9443022569Updated Oct 4, 2025Oct 4, 2025
    • aiter

      Public
      AI Tensor Engine for ROCm
      Python
      1142835296Updated Oct 4, 2025Oct 4, 2025
    • monorepo for rocm libraries
      Assembly
      137132141319Updated Oct 4, 2025Oct 4, 2025
    • Fast and memory-efficient exact attention
      Python
      2k192111Updated Oct 4, 2025Oct 4, 2025
    • builder

      Public
      Continuous builder and binary build scripts for pytorch
      Shell
      230305Updated Oct 4, 2025Oct 4, 2025
    • iris

      Public
      AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming
      Python
      17802918Updated Oct 4, 2025Oct 4, 2025
    • Device Metrics Exporter exports metrics from AMD devices (GPUs) to collectors like Prometheus.
      C++
      232774Updated Oct 4, 2025Oct 4, 2025
    • rocMLIR

      Public
      MLIR
      49150342Updated Oct 4, 2025Oct 4, 2025
    • rocDecode

      Public
      rocDecode is a high performance video decode SDK for AMD hardware
      C++
      243071Updated Oct 4, 2025Oct 4, 2025
    • [DEPRECATED] Moved to ROCm/rocm-systems repo
      C++
      202701Updated Oct 4, 2025Oct 4, 2025
    • AMD's graph optimization engine.
      C++
      10925223766Updated Oct 4, 2025Oct 4, 2025
    • maxtext

      Public
      A simple, performant and scalable Jax LLM!
      Python
      418001Updated Oct 4, 2025Oct 4, 2025
    • aotriton

      Public
      Ahead of Time (AOT) Triton Math Library
      Python
      3077181Updated Oct 4, 2025Oct 4, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      11k103533Updated Oct 3, 2025Oct 3, 2025
    • triton

      Public
      Development repository for the Triton language and compiler
      Python
      2.3k131869Updated Oct 3, 2025Oct 3, 2025
    • ROCm Documentation Python package for ReadTheDocs build standardization
      CSS
      2016913Updated Oct 3, 2025Oct 3, 2025
    • C++
      1100Updated Oct 3, 2025Oct 3, 2025
    • FlashInfer+ROCm: ROCm port of FlashInfer
      Cuda
      529211Updated Oct 3, 2025Oct 3, 2025
    • [DEPRECATED] Moved to ROCm/rocm-libraries repo
      C++
      2310Updated Oct 3, 2025Oct 3, 2025
    • ONNX Runtime: cross-platform, high performance scoring engine for ML models
      C++
      3.5k605Updated Oct 3, 2025Oct 3, 2025
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      25k2435966Updated Oct 3, 2025Oct 3, 2025
    • hipDNN

      Public
      [DEPRECATED] Moved to ROCm/rocm-libraries repo
      C++
      195510Updated Oct 3, 2025Oct 3, 2025
    • xla

      Public
      A machine learning compiler for GPUs, CPUs, and ML accelerators
      C++
      6534058Updated Oct 3, 2025Oct 3, 2025
    • rocJPEG

      Public
      rocJPEG is a high-performance jpeg decode SDK for decoding jpeg images using a hardware-accelerated jpeg decoder on AMD’s GPUs.
      C++
      13720Updated Oct 3, 2025Oct 3, 2025
    • rocm-jax

      Public
      Python
      14511Updated Oct 3, 2025Oct 3, 2025
    • Python
      2246815Updated Oct 3, 2025Oct 3, 2025