Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 580 97

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 384 61

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.4k 1.5k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.6k 225

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 3.9k 442

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.6k 923

Repositories

Showing 10 of 634 repositories
  • NVSentinel Public

    NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

    NVIDIA/NVSentinel’s past year of commit activity
    Go 87 Apache-2.0 21 43 9 Updated Dec 1, 2025
  • cccl Public

    CUDA Core Compute Libraries

    NVIDIA/cccl’s past year of commit activity
  • cuda-quantum Public

    C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    NVIDIA/cuda-quantum’s past year of commit activity
    C++ 862 306 413 (16 issues need help) 86 Updated Dec 1, 2025
  • TensorRT-Model-Optimizer Public

    A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed.

    NVIDIA/TensorRT-Model-Optimizer’s past year of commit activity
    Python 1,592 Apache-2.0 204 73 42 Updated Dec 1, 2025
  • spark-rapids Public

    Spark RAPIDS plugin - accelerate Apache Spark with GPUs

    NVIDIA/spark-rapids’s past year of commit activity
    Scala 950 Apache-2.0 264 1,755 (47 issues need help) 31 Updated Dec 1, 2025
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    C++ 12,268 1,901 675 454 Updated Dec 1, 2025
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 14,363 3,328 332 225 Updated Dec 1, 2025
  • TensorRT-Incubator Public

    Experimental projects related to TensorRT

    NVIDIA/TensorRT-Incubator’s past year of commit activity
    MLIR 116 19 37 (1 issue needs help) 13 Updated Dec 1, 2025
  • numba-cuda Public

    The CUDA target for Numba

    NVIDIA/numba-cuda’s past year of commit activity
    Python 220 BSD-2-Clause 47 92 (1 issue needs help) 25 Updated Dec 1, 2025
  • OSMO Public

    The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML

    NVIDIA/OSMO’s past year of commit activity
    Python 40 Apache-2.0 1 7 5 Updated Dec 1, 2025