Skip to content
Change the repository type filter

All

    Repositories list

    • Megatron-LM

      Public
      Ongoing research training transformer models at scale
      Python
      3.4k14k330239Updated Dec 7, 2025Dec 7, 2025
    • tilus

      Public
      Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.
      Python
      1241480Updated Dec 7, 2025Dec 7, 2025
    • KAI-Scheduler

      Public
      KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
      Go
      1149562336Updated Dec 7, 2025Dec 7, 2025
    • torch-harmonics

      Public
      Differentiable signal processing on the sphere for PyTorch
      Jupyter Notebook
      6261034Updated Dec 7, 2025Dec 7, 2025
    • aistore

      Public
      AIStore: scalable storage for AI applications
      Go
      2281.7k00Updated Dec 7, 2025Dec 7, 2025
    • Model-Optimizer

      Public
      A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
      Python
      2061.6k6844Updated Dec 7, 2025Dec 7, 2025
    • cuda-python

      Public
      CUDA Python: Performance meets Productivity
      Python
      2273.1k20515Updated Dec 7, 2025Dec 7, 2025
    • nvidia-resiliency-ext

      Public
      NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to failures and interruptions.
      Python
      37239115Updated Dec 7, 2025Dec 7, 2025
    • accelerated-computing-hub

      Public
      NVIDIA curated collection of educational resources related to general purpose GPU programming.
      Jupyter Notebook
      167926134Updated Dec 7, 2025Dec 7, 2025
    • Fuser

      Public
      A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
      C++
      69364205212Updated Dec 7, 2025Dec 7, 2025
    • TensorRT-LLM

      Public
      TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
      Python
      1.9k12k618453Updated Dec 7, 2025Dec 7, 2025
    • k8s-dra-driver-gpu

      Public
      NVIDIA DRA Driver for GPUs
      Go
      1015049427Updated Dec 7, 2025Dec 7, 2025
    • Experimental projects related to TensorRT
      MLIR
      191163712Updated Dec 7, 2025Dec 7, 2025
    • k8s-nim-operator

      Public
      An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
      Go
      34140729Updated Dec 7, 2025Dec 7, 2025
    • NV-Kernels

      Public
      Ubuntu kernels which are optimized for NVIDIA server systems
      C
      4370011Updated Dec 7, 2025Dec 7, 2025
    • JAX-Toolbox

      Public
      JAX-Toolbox
      Python
      673678043Updated Dec 7, 2025Dec 7, 2025
    • TileGym

      Public
      Helpful kernel tutorials and examples for tile-based GPU programming
      Python
      925200Updated Dec 7, 2025Dec 7, 2025
    • go-gpuallocator

      Public
      Go Abstraction for Allocating NVIDIA GPUs with Custom Policies
      Go
      2611957Updated Dec 7, 2025Dec 7, 2025
    • nvidia-container-toolkit

      Public
      Build and run containers leveraging NVIDIA GPUs
      Go
      4463.9k11723Updated Dec 7, 2025Dec 7, 2025
    • mig-parted

      Public
      MIG Partition Editor for NVIDIA GPUs
      Go
      522312216Updated Dec 7, 2025Dec 7, 2025
    • NVSentinel

      Public
      NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
      Go
      2599407Updated Dec 7, 2025Dec 7, 2025
    • k8s-device-plugin

      Public
      NVIDIA device plugin for Kubernetes
      Go
      7583.6k7534Updated Dec 7, 2025Dec 7, 2025
    • cuCollections

      Public
      C++
      1015985513Updated Dec 7, 2025Dec 7, 2025
    • gpu-operator

      Public
      NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
      Go
      4202.4k9467Updated Dec 7, 2025Dec 7, 2025
    • cuopt

      Public
      GPU accelerated decision optimization
      Cuda
      1006017226Updated Dec 7, 2025Dec 7, 2025
    • cloud-native-docs

      Public
      Documentation repository for NVIDIA Cloud Native Technologies
      PowerShell
      3331510Updated Dec 7, 2025Dec 7, 2025
    • vgpu-device-manager

      Public
      NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes
      Go
      22152016Updated Dec 7, 2025Dec 7, 2025
    • stdexec

      Public
      `std::execution`, the proposed C++ framework for asynchronous and parallel programming.
      C++
      2172.1k11211Updated Dec 6, 2025Dec 6, 2025
    • nvbench

      Public
      CUDA Kernel Benchmarking Library
      Cuda
      94773538Updated Dec 6, 2025Dec 6, 2025
    • cccl

      Public
      CUDA Core Compute Libraries
      C++
      2972.1k1.1k195Updated Dec 6, 2025Dec 6, 2025