Billing

Overview of Billing Components

Serverless Kubernetes is billed per-minute and based on the resources your pods use. Here are the key components of our billing model:

GPU: Each GPU used in your pods is billed at a fixed rate.
vCPU: Every vCPU used by your pods is billed. If your pods use CPU resources without a GPU, these are billed separately. CPU usage is billed at 1/100th the cost of a vCPU.
Memory: Memory usage is billed per MB. This allows for granular control over the memory allocation and cost.
Load Balancer: Running Load balancer on the cluster.

Billing for Suspended/Stopped/Idle Resources

Pods are only billed when they are actively running. If a pod is stopped, suspended, or idle, you will not be charged for those resources during that time.

When a cluster is Suspended, all pods are stopped and stop being billed for. The exception to this are load balancers, as the IP address is persisted during the time the cluster is suspended.

Resource Pricing

The following tables outlines the pricing per hour/per minute for:

1. GPU Resources:

Resource	SKU	Price ($/minute)	Price ($/hour)
NVIDIA H200SXM 141GB	ogx-k8s-h200sxm-141	0.07875	4.725
NVIDIA H100SXM 80GB	ogx-k8s-h100sxm-80	0.06525	3.92
NVIDIA H100 80GB	ogx-k8s-h100	0.06525	3.92
NVIDIA L40S 48GB	ogx-k8s-l40s	0.03488	2.09
NVIDIA L4 24GB	ogx-k8s-l4	0.019044	1.14

2. Other Resources:

Resource	SKU	Price ($/minute)	Price ($/hour)
Memory (MB)	ogx-k8s-mem	0.00000008	0.000005
CPU (1/100)	ogx-k8s-vcpu	0.00000167	0.0001
Load Balancer	ogx-k8s-lb	0.0003015	0.01809

You can read more about billing in our Support section.

Billing

Overview of Billing Components​

Billing for Suspended/Stopped/Idle Resources​

Resource Pricing​

Contents

Overview of Billing Components

Billing for Suspended/Stopped/Idle Resources

Resource Pricing