Skip to content

AMD ROCm™ Software

AMD ROCm software is AMD's Open Source stack for GPU computation.

To learn more about ROCm, check out our Documentation, Examples, and Developer Hub.

If you have questions or need help, reach out to us on GitHub.

Popular repositories Loading

  1. ROCm ROCm Public

    AMD ROCm™ Software - GitHub Home

    Shell 5k 405

  2. HIP HIP Public

    HIP: C++ Heterogeneous-Compute Interface for Portability

    C++ 3.9k 547

  3. MIOpen MIOpen Public

    AMD's Machine Intelligence Library

    Assembly 1.1k 242

  4. tensorflow-upstream tensorflow-upstream Public

    Forked from tensorflow/tensorflow

    TensorFlow ROCm port

    C++ 690 97

  5. HIPIFY HIPIFY Public

    HIPIFY: Convert CUDA to Portable C++ Code

    C++ 553 79

  6. ROCm-docker ROCm-docker Public

    Dockerfiles for the various software layers defined in the ROCm software platform

    Shell 452 70

Repositories

Showing 10 of 303 repositories
  • composable_kernel Public

    Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

    ROCm/composable_kernel’s past year of commit activity
    C++ 350 149 24 (1 issue needs help) 50 Updated Feb 21, 2025
  • llvm-project Public Forked from llvm/llvm-project

    This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.

    ROCm/llvm-project’s past year of commit activity
    LLVM 135 12,954 19 8 Updated Feb 21, 2025
  • aiter Public

    AI Tensor Engine for ROCm

    ROCm/aiter’s past year of commit activity
    Cuda 23 MIT 8 4 6 Updated Feb 21, 2025
  • MIOpen Public

    AMD's Machine Intelligence Library

    ROCm/MIOpen’s past year of commit activity
    Assembly 1,117 242 248 (4 issues need help) 73 Updated Feb 21, 2025
  • hipBLASLt Public

    hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library

    ROCm/hipBLASLt’s past year of commit activity
    Assembly 79 MIT 105 9 76 Updated Feb 21, 2025
  • pytorch Public Forked from pytorch/pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    ROCm/pytorch’s past year of commit activity
    Python 221 24,012 60 40 Updated Feb 21, 2025
  • rocMLIR Public
    ROCm/rocMLIR’s past year of commit activity
    MLIR 137 39 1 29 Updated Feb 21, 2025
  • rocJPEG Public

    rocJPEG is a high-performance jpeg decode SDK for decoding jpeg images using a hardware-accelerated jpeg decoder on AMD’s GPUs.

    ROCm/rocJPEG’s past year of commit activity
    C++ 3 MIT 9 1 0 Updated Feb 21, 2025
  • rocprofiler-compute Public

    Advanced Profiling and Analytics for AMD Hardware

    ROCm/rocprofiler-compute’s past year of commit activity
    Python 140 51 51 14 Updated Feb 21, 2025
  • ROCR-Runtime Public

    ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime

    ROCm/ROCR-Runtime’s past year of commit activity
    C++ 235 115 21 22 Updated Feb 21, 2025