Replaced by the lists at gpus.llm-utils.org Serverless can be good if you only need low utilization, and slow response times are fine (for example, maybe you need to generate lots of stable diffusion images but at an unpredictable pace and volume where you don't want a 4090 running 24/7 on a gpu cloud, and it's fine to wait 1 minute for each image)