Nvidia H100 and A100 GPUs - comparing available capacity at GPU cloud providers

## Summary * If you want instant access to instances of 1x H100, use Lambda Labs or FluidStack. * If you want instant access to instances of 1x-8x A100s, use Runpod. * If you want access to a large H100 cluster, you'll need to talk to sales. You'll probably want to talk with FluidStack, Lambda, and Oracle. * I'll personally be using a mix of FluidStack, Lambda, and Runpod. ## Best GPU Clouds for H100 and A100 instant access This table shows instant/on-demand availability and prices of H100s and A100s at the 3 clouds I believe are currently the best options for most people looking for instant access to reliable instances of fast GPUs: FluidStack, Runpod, and Lambda. 👉 **We made a tool for live price and availability updates of H100s and A100s: see [here](https://gpus.llm-utils.org/tracking-h100-and-a100-gpu-cloud-availability/).** June 29th updates (prior update was June 24th): * Lambda and FluidStack both out of capacity on A100 80GB SXM * Lambda now out of capacity on A100 40GB SXM * FluidStack now out of capacity on A100 40GB PCIe | GPU | Runpod June 29th | Lambda June 29th | FluidStack June 29th | |----------------|------------------|------------------|----------------------| | H100 80GB PCIe | No | ✅ $1.99 | ✅ $1.99 | | A100 80GB SXM | ✅ $1.84 | Out of capacity | Out of capacity | | A100 80GB PCIe | ✅ $1.79 | No | $2.91 | | A100 40GB PCIe | No | Out of capacity | Out of capacity | | A100 40GB SXM | No | Out of capacity | No | [See more details, links, and add comments on the google sheet here](https://docs.google.com/spreadsheets/d/1Wh2jLmfVMdDonCi1eFsL5dnLEQOBFKdPdlhYl4c5hPk/edit#gid=83934879). Note that Runpod has instances of up to 6x H100s available instantly, though at higher prices and in the community cloud rather than secure cloud. ## Moved posts Also covers: which GPUs to use for which use cases, best GPU clouds by use case (Falcon-40B, MPT-30B, Stable Diffusion), H100/A100 alternatives and a GPU ranking, options for accessing large H100 clusters, and a long list of alternative GPU clouds. [Summary of best GPUs and clouds by use case](https://gpus.llm-utils.org/recommended-gpus-and-gpu-clouds-for-ai/) [Best GPUs and clouds for running Falcon-40B](https://gpus.llm-utils.org/gpus-for-running-falcon-40b/) [Best GPUs for running MPT-30B](https://gpus.llm-utils.org/gpus-for-running-mpt-30b/) [Best GPUs for running Stable Diffusion](https://gpus.llm-utils.org/gpus-for-running-stable-diffusion/) [Accessing very large numbers of H100s](https://gpus.llm-utils.org/accessing-large-h100-clusters/) [Alternative high end GPUs and their recommended cloud providers](https://gpus.llm-utils.org/gpu-clouds-for-each-gpu/) [Even more options of GPU clouds for running Falcon-40B](https://gpus.llm-utils.org/gpu-clouds-for-falcon-40b/) [DGX GH200 vs GH200 vs H100](https://gpus.llm-utils.org/dgx-gh200-vs-gh200-vs-h100/) [RTX 6000 vs A6000 vs 6000 Ada](https://gpus.llm-utils.org/rtx-6000-vs-a6000-vs-6000-ada/) [VRAM and vCPUs](https://gpus.llm-utils.org/vram-and-vcpus/) [NVLink, SXM, PCIe](https://gpus.llm-utils.org/nvlink-sxm-and-pcie/) [H100 GPU cloud availability and pricing](https://gpus.llm-utils.org/h100-gpu-cloud-availability-and-pricing/) [A100 GPU cloud availability and pricing](https://gpus.llm-utils.org/a100-gpu-cloud-availability-and-pricing/) [Comparison of the user experience at each of the GPU cloud providers](https://gpus.llm-utils.org/gpu-cloud-user-experience-comparison/) [Giant list of additional GPU cloud options](https://gpus.llm-utils.org/alternative-gpu-clouds/) [DGX Cloud](https://gpus.llm-utils.org/dgx-cloud/) [Upcoming and recent updates](https://gpus.llm-utils.org/recent-update-log/) ### Other relevant resources * Read my post on [[Building your own GPU cluster]] from my conversation with César, the CEO of FluidStack. * Listen to my [[Conversation with Zhen Lu, Founder of Runpod.io]] * Listen to my [[Conversation with Doruk, CEO at Oblivus]] * If you're running batch processing jobs where you don't need the GPUs interlinked and you don't need high RAM cards (e.g. stable diffusion, batch whisper processing jobs), read our overview of [[Salad.com GPU cloud - company overview and conversation with Head of Product Daniel Sarfati]]. Meta: * New posts: [Get notified about new posts via email](https://airtable.com/shr411VWRbl9og1xb). * HN: [See the discussion on Hacker News](https://news.ycombinator.com/item?id=36333321).