Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions triton/ref/gpu.rst
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,7 @@
NVIDIA B300(*) | ``b300`` | ``288`` | 10.3 (``103``) | 16 | gpu[64-65] | 8 | 18944 | ``gpu-b300-288g-ellis``, ``gpu-b300-288g-short`` |
NVIDIA H200(*) | ``h200`` | ``141`` | 9.0 (``90``) | 112 | gpu[50-63] | 8 | 16896 | ``gpu-h200-141g-ellis``, ``gpu-h200-141g-short`` |
NVIDIA H200(**) | ``h200_2g.35gb`` | ``35`` | 9.0 (``90``) | 24 | gpu[49] | 24 | 4224 | ``gpu-h200-35g-ia-ellis``, ``gpu-h200-35g-ia`` |
NVIDIA :doc:`Grace-H200(+) </triton/usage/gracehopper>` | ``h200`` | ``141`` | 9.0 (``90``) | 4 | gpuarm[1-2] | 2 | 16896 | ``gpu-grace-h200-141g`` |
NVIDIA H100 | ``h100`` | ``80`` | 9.0 (``90``) | 16 | gpu[45-48] | 4 | 16896 | ``gpu-h100-80g`` |
NVIDIA A100 | ``a100`` | ``80`` | 8.0 (``80``) | 56 | gpu[11-17,38-44] | 4 | 7936 | ``gpu-a100-80g`` |
NVIDIA V100 | ``v100`` | ``32`` | 7.0 (``70``) | 40 | gpu[28-37] | 4 | 5120 | ``gpu-v100-32g`` |
Expand All @@ -27,3 +28,5 @@ preempted if a job requiring the resources comes in from the Ellis queue.

(**) These GPUs are split from a single GPU with NVIDIA's
`Multi-Instance GPU <https://docs.nvidia.com/datacenter/tesla/mig-user-guide/index.html>`__-feature.

(+) These computers have the Nvidia Grace CPU, an ARM based cpu. Normal software compiled for x86 does not run on these nodes. See :doc:`/triton/usage/gracehopper`.
1 change: 1 addition & 0 deletions triton/ref/hardware.rst
Original file line number Diff line number Diff line change
Expand Up @@ -21,3 +21,4 @@
gpu[49] | 1 | Dell PowerEdge XE9680 | 2025 | emerald avx2 h200 hopper | 2x32 core `Xeon® Platinum 8562Y+ <https://www.intel.com/content/www/us/en/products/sku/237558/intel-xeon-platinum-8562y-processor-60m-cache-2-80-ghz/specifications.html>`__ 2.8GHz | 2048GB DDR5-5600 | HDR | 8x `H200 SXM <https://www.nvidia.com/en-us/data-center/h200/>`__ each split to 3x35GB | 20 TB SSD
gpu[50-63] | 14 | Dell PowerEdge XE9680 | 2025 | emerald avx2 h200 hopper | 2x32 core `Xeon® Platinum 8562Y+ <https://www.intel.com/content/www/us/en/products/sku/237558/intel-xeon-platinum-8562y-processor-60m-cache-2-80-ghz/specifications.html>`__ 2.8GHz | 2048GB DDR5-5600 | HDR | 8x `H200 SXM <https://www.nvidia.com/en-us/data-center/h200/>`__ 141GB | 20 TB SSD
gpu[64-65] | 2 | Dell PowerEdge XE9780 | 2026 | granite avx2 b300 blackwell | 2x64 core `Xeon® 6767P <https://www.intel.com/content/www/us/en/products/sku/241845/intel-xeon-6767p-processor-336m-cache-2-40-ghz/specifications.html>`__ 2.4GHz | 3072GB DDR5-6400 | NDR | 8x `B300 SXM6 AC <https://www.nvidia.com/en-us/data-center/technologies/blackwell-architecture/>`__ 288GB | 28 TB SSD
gpuarm[1-2] | 2 | Supermicro ARS-221GL-NHIR | 2026 | NVIDIA :doc:`grace h200 hopper </triton/usage/gracehopper>` | 2x72 core `Grace A02 Neoverse-V2<https://www.nvidia.com/en-us/data-center/grace-cpu-superchip/>`__ 3.4GHz | 1318GB LPDDR5-6400 | NDR | 2x H200 `H200 SXM <https://www.nvidia.com/en-us/data-center/h200/>`__ 144GB HBM3e| 21 TB SSD
13 changes: 13 additions & 0 deletions triton/usage/gracehopper.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
Grace Hopper Super Chips
========================


Hardware in Triton
------------------

Triton has currently two compute nodes (gpuarm[1,2]) with the Grace-Hopper superchip.
To gain access, please use the slurm partition: ``--partition gpu-grace-h200-141g``.

Arm vs x86
----------

Loading