Cloud GPU

NVIDIA H100 & H200 Tensor Core GPUs

NVIDIA HGX H100 Tensor Core GPU
Based on the NVIDIA Hopper™ architecture, NVIDIA H100 features fourth-generation Tensor Cores and a Transformer Engine with FP8 precision that provides up to 4x faster training over the prior generation for GPT-3 (175B) models.

NVIDIA HGX H200 Tensor Core GPU
NVIDIA H200 is the first GPU to offer 141 gigabytes (GB) of HBM3e memory at 4.8 terabytes per second (TB/s) – that’s nearly double the capacity of the NVIDIA H100 Tensor Core GPU with 1.4X more memory bandwidth. The H200’s larger and faster memory accelerates generative AI and large language models, while advancing scientific computing for HPC workloads with better energy efficiency and lower total cost of ownership.

NVIDIA HGX H100 now available on Vultr

Unprecedented acceleration for the world’s most
demanding AI and machine learning workloads
starting at $2.30 per hour

AI Training and AI Inference

8 x NVIDIA H100 80 GB SXM
2 x 480 GB NVMe
8 x 3.84 TB NVMe
2x Intel Platinum 8480+
112 cores / 224 threads @ 2.0GHz
2048 GB Memory
15 TB Bandwidth
100 Gbps Network

Availability

Mini Cluster 64 H100 GPUs

Reserve Mini Cluster now

Base Cluster 248 H100 GPUs

Reserve Base Cluster now

Pricing

Starting at $2.30 per hour

Key Features

NVIDIA Quantum-2 3200Gb/s
InfiniBand Networking
Non-Blocking InfiniBand Network
Design
NVIDIA HGX H100 SXM with FP8
Support

Enterprise-ready at any scale and any location

Clusters at any size

Vultr's enterprise-ready infrastructure seamlessly supports any cluster size of NVIDIA H100 and H200 GPUs. Whether you require a small cluster or a massive deployment, Vultr ensures reliable, high-performance computing to meet your specific needs.

Get in touch to learn more

Globally available, locally accessible

Large clusters of NVIDIA H100 and H200 GPUs are available where you need them, thanks to Vultr’s extensive infrastructure. With 32 global cloud data center locations across six continents, we guarantee low latency and high availability, enabling your enterprise to achieve optimal performance worldwide.

Learn more about Vultr’s data center locations

Enterprise-grade compliance and security

Vultr ensures our platform, products, and services meet diverse global compliance, privacy, and security needs, covering areas such as server availability, data protection, and privacy. Our commitment to industry-wide privacy and security frameworks, including ISO and SOC 2 Type 2 standards, demonstrates our dedication to protecting our customers' data.

Learn more about Vultr’s security and compliance

Purpose-built for AI, simulation, and
data analytics

AI, complex simulations, and massive datasets require multiple GPUs with extremely fast interconnections and a fully accelerated software stack. The NVIDIA HGX™ AI supercomputing platform brings together the full power of NVIDIA GPUs, NVLink®, NVIDIA networking, and fully optimized AI and high-performance computing (HPC) software stacks to provide the highest application performance and drive the fastest time to insights.

Download the H100 datasheet Download the H200 datasheet

no form fill or personal details required for access

The world’s most powerful GPU

NVIDIA H200 supercharges generative AI and high-performance computing (HPC) workloads with game-changing performance and memory capabilities. As the first GPU with HBM3e, the H200’s larger and faster memory fuels the acceleration of generative AI and large language models (LLMs) while advancing scientific computing for HPC workloads.

Llama2 70B inference

1.9x faster

GPT-3 175B inference

1.6x faster

High-performance computing

110x faster

	NVIDIA H100 SXM	NVIDIA H200 SXM¹
FP64	34 TFLOPS	34 TFLOPS
FP64 Tensor Core	67 TFLOPS	67 TFLOPS
FP32	67 TFLOPS	67 TFLOPS
TF32 Tensor Core	989 TFLOPS²	989 TFLOPS²
BFLOAT16 Tensor Core	1,979 TFLOPS²	1,979 TFLOPS²
FP16 Tensor Core	1,979 TFLOPS²	1,979 TFLOPS²
FP8 Tensor Core	3,958 TFLOPS²	3,958 TFLOPS²
INT8 Core	3,958 TFLOPS²	3,958 TFLOPS²
GPU Memory	80GB	141GB
GPU Memory Bandwith	3.35TB/s	4.8TB/s
Decoders	7 NVDEC 7JPEG	7 NVDEC 7JPEG
Interconnect	NVIDIA NVLink®: 900GB/s PCIe Gen5: 128GB/s	NVIDIA NVLink®: 900GB/s PCIe Gen5: 128GB/s
¹Preliminary specifications. May be subject to change. ²With sparsity.

NVIDIA H100 & H200 Tensor Core GPUs

NVIDIA HGX H100 now available on Vultr

Unprecedented acceleration for the world’s most
demanding AI and machine learning workloads
starting at $2.30 per hour

NVIDIA HGX H100

Enterprise-ready at any scale and any location

Clusters at any size

Globally available, locally accessible

Enterprise-grade compliance and security

Purpose-built for AI, simulation, and
data analytics

The world’s most powerful GPU

NVIDIA H100 & H200
Specifications

Reserve the NVIDIA H100 & H200 now

Unprecedented acceleration for the world’s most demanding AI and machine learning workloadsstarting at $2.30 per hour

NVIDIA HGX H100

Enterprise-ready at any scale and any location

Clusters at any size

Globally available, locally accessible

Enterprise-grade compliance and security

Purpose-built for AI, simulation, and data analytics

The world’s most powerful GPU

NVIDIA H100 & H200 Specifications

Reserve the NVIDIA H100 & H200 now

Unprecedented acceleration for the world’s most
demanding AI and machine learning workloads
starting at $2.30 per hour

Purpose-built for AI, simulation, and
data analytics

NVIDIA H100 & H200
Specifications