Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 815 159

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 434 75

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.9k 1.7k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.8k 246

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 4.3k 509

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.9k 1k

Repositories

Showing 10 of 710 repositories
  • mig-parted Public

    MIG Partition Editor for NVIDIA GPUs

    NVIDIA/mig-parted’s past year of commit activity
    Go 246 Apache-2.0 60 9 7 Updated Apr 14, 2026
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 16,030 3,825 340 (1 issue needs help) 360 Updated Apr 14, 2026
  • aicr Public

    Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

    NVIDIA/aicr’s past year of commit activity
    Go 264 Apache-2.0 29 28 (3 issues need help) 12 Updated Apr 14, 2026
  • makani Public

    Massively parallel training of machine-learning based weather and climate models

    NVIDIA/makani’s past year of commit activity
    Python 369 72 5 4 Updated Apr 14, 2026
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 13,356 2,282 575 705 Updated Apr 14, 2026
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 2,466 Apache-2.0 353 61 131 Updated Apr 14, 2026
  • terraform-provider-mcahr Public

    MCAHR terraform provider repo

    NVIDIA/terraform-provider-mcahr’s past year of commit activity
    Go 2 Apache-2.0 2 0 3 Updated Apr 14, 2026
  • terraform-provider-shoreline Public

    Shoreline terraform provider repo

    NVIDIA/terraform-provider-shoreline’s past year of commit activity
    Go 2 Apache-2.0 14 0 3 Updated Apr 14, 2026
  • NeMo-Retriever Public

    NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.

    NVIDIA/NeMo-Retriever’s past year of commit activity
    Python 2,902 Apache-2.0 315 126 (1 issue needs help) 84 Updated Apr 14, 2026
  • kvpress Public

    LLM KV cache compression made easy

    NVIDIA/kvpress’s past year of commit activity
    Python 1,031 Apache-2.0 130 5 1 Updated Apr 14, 2026