Skip to content
Change the repository type filter

All

    Repositories list

    • tensorflow_musa_extension

      Public
      C++
      10204Updated Feb 28, 2026Feb 28, 2026
    • Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
      C++
      Other
      4594600Updated Feb 28, 2026Feb 28, 2026
    • torchada

      Public
      An adapter layer that ensures torch_musa🔦 delivers a CUDA-compatible PyTorch experience.
      Python
      MIT License
      32100Updated Feb 27, 2026Feb 27, 2026
    • Provides a Python interface to GPU management and monitoring functions. This is a wrapper around the MTML library.
      C
      MIT License
      3510Updated Feb 27, 2026Feb 27, 2026
    • LiteGS

      Public
      A refactored codebase for Gaussian Splatting. Training 3DGS in 50 seconds!
      Cuda
      Other
      2933340Updated Feb 26, 2026Feb 26, 2026
    • muAlg

      Public
      Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
      Cuda
      BSD 3-Clause "New" or "Revised" License
      463600Updated Feb 11, 2026Feb 11, 2026
    • torch_musa

      Public
      torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics cards.
      Python
      Other
      35478230Updated Feb 6, 2026Feb 6, 2026
    • MT-TransformerEngine

      Public
      A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better per…
      Python
      Apache License 2.0
      650901Updated Feb 5, 2026Feb 5, 2026
    • MT-MegatronLM

      Public
      Python
      21000Updated Feb 5, 2026Feb 5, 2026
    • pytorch3d

      Public
      PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
      Python
      Other
      1.4k200Updated Feb 5, 2026Feb 5, 2026
    • mutlass

      Public
      MUSA Templates for Linear Algebra Subroutines
      C++
      Other
      1.7k4210Updated Jan 30, 2026Jan 30, 2026
    • tvm_musa

      Public
      Open Machine Learning Compiler Framework
      Python
      Apache License 2.0
      3.8k100Updated Jan 30, 2026Jan 30, 2026
    • gpu-compute-driver-bench

      Public
      C++
      Apache License 2.0
      2710Updated Jan 26, 2026Jan 26, 2026
    • mtai-sdk-ts

      Public
      TypeScript
      Other
      1101Updated Jan 23, 2026Jan 23, 2026
    • PyTorch media decoding and encoding
      Python
      BSD 3-Clause "New" or "Revised" License
      97100Updated Jan 22, 2026Jan 22, 2026
    • tutorial_on_musa

      Public
      Shell
      Other
      64461Updated Jan 13, 2026Jan 13, 2026
    • SimuMax

      Public
      a static analytical model for LLM distributed training
      Python
      Other
      1511720Updated Jan 8, 2026Jan 8, 2026
    • A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface
      Jupyter Notebook
      Other
      13000Updated Jan 7, 2026Jan 7, 2026
    • Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
      Python
      Other
      3.7k100Updated Jan 7, 2026Jan 7, 2026
    • StableGS

      Public
      Cuda
      0910Updated Jan 5, 2026Jan 5, 2026
    • FFmpeg

      Public
      Mirror of https://git.ffmpeg.org/ffmpeg.git
      C
      Other
      14k100Updated Dec 30, 2025Dec 30, 2025
    • vision

      Public
      Datasets, Transforms and Models specific to Computer Vision
      Python
      BSD 3-Clause "New" or "Revised" License
      7.2k000Updated Dec 29, 2025Dec 29, 2025
    • mate

      Public
      MUSA AI Tensor Engine
      C++
      Apache License 2.0
      0500Updated Dec 19, 2025Dec 19, 2025
    • kineto

      Public
      HTML
      Other
      3100Updated Nov 20, 2025Nov 20, 2025
    • AI_Agent

      Public
      Jupyter Notebook
      MIT License
      0000Updated Nov 17, 2025Nov 17, 2025
    • URPO

      Public
      0000Updated Nov 14, 2025Nov 14, 2025
    • pytorch_sparse

      Public
      PyTorch Extension Library of Optimized Autograd Sparse Matrix Operations
      Python
      MIT License
      159000Updated Oct 17, 2025Oct 17, 2025
    • PyTorch Extension Library of Optimized Scatter Operations
      Python
      MIT License
      207000Updated Oct 17, 2025Oct 17, 2025
    • Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models on MTGPU.
      Go
      Other
      15k3530Updated Oct 13, 2025Oct 13, 2025
    • PaddlePaddle custom device implementaion. (『飞桨』自定义硬件接入实现)
      C++
      Apache License 2.0
      212000Updated Sep 24, 2025Sep 24, 2025