GPU Pricing
Thousands of GPUs across 30+ regions.
Simple pricing plans for teams of all sizes,
designed to scale with you.
GPU
Serverless Pricing
Cost effective for every inference workload.
Save 15% over other Serverless cloud
providers on flex workers alone.
GPU
Flex
Active
$0
$0
80GB
H100
Extreme throughput for big models.
$0
$0
80GB
A100
High throughput GPU, yet still very cost-effective.
$0
$0
48GB
L40, L40S, 6000 Ada
Extreme inference throughput on LLMs like Llama 3 7B.
$0
$0
48GB
A6000, A40
A cost-effective option for running big models.
$0
$0
24GB
4090
Extreme throughput for small-to-medium models.
$0
$0
24GB
L4, A5000, 3090
Great for small-to-medium sized inference workloads.
$0
$0
16GB
A4000, A4500, RTX 4000
The most cost-effective for small models.
Comparison
Find your perfect setup.
A simple, transparent breakdown of what’s
included in every plan.
Starter
Individuals
$0/mo
+ usage
Scale
Small teams
$250/mo
+ usage
Pro
Organizations
$500/mo
+ usage
Enterprise
Corporations
Custom
Workspace
Number of Seats
Starter
Scale
Pro
Enterprise
Up to 3
Up to 10
Unlimited
Unlimited
Compute & Performance
Included Compute Credit
Starter
Scale
Pro
Enterprise
$100/mo
$250/mo
Custom
GPU Concurrency Limit
Starter
Scale
Pro
Enterprise
5 GPUs
30 GPUs
60 GPUs
Custom
Hardware Priority
Starter
Scale
Pro
Enterprise
Subject to availability
Prioritized
over Starter
Prioritized
over Scale
Reserve pools
Storage & Boot
Flash Boot
Starter
Scale
Pro
Enterprise
Passive
Passive
Priority
Network Storage
Starter
Scale
Pro
Enterprise
1TB max
12TB max
50TB max
Unlimited
Support & SLA
Response Time
Starter
Scale
Pro
Enterprise
~72 hours
~48 hours
~24 hours
Private Slack
Log Retention
Starter
Scale
Pro
Enterprise
7 days
30 days
~24 hours
Custom
SLA
Starter
Scale
Pro
Enterprise
99.9%
99.9%
99.9%
99.9%
Security & Compliance
Datacenter Security
Starter
Scale
Pro
Enterprise
SOC2, HIPAA, ISO 27001, PCI DSS & GDPR
SOC2, HIPAA, ISO 27001, PCI DSS & GDPR
SOC2, HIPAA, ISO 27001, PCI DSS & GDPR
SOC2, HIPAA, ISO 27001, PCI DSS & GDPR
Are you an early-stage startup or ML researcher?
Get up to $25K in free compute credits to use on demand GPUs and serverless endpoints.
