Skip to main content

Billing

Overview of Billing Components

Serverless Kubernetes is billed per-minute and based on the resources your pods use. Here are the key components of our billing model:

  • GPU: Each GPU used in your pods is billed at a fixed rate.
  • vCPU: Every vCPU used by your pods is billed. If your pods use CPU resources without a GPU, these are billed separately. CPU usage is billed at 1/100th the cost of a vCPU.
  • Memory: Memory usage is billed per MB. This allows for granular control over the memory allocation and cost.
  • Load Balancer: Running Load balancer on the cluster.

Billing for Suspended/Stopped/Idle Resources

Pods are only billed when they are actively running. If a pod is stopped, suspended, or idle, you will not be charged for those resources during that time.

When a cluster is Suspended, all pods are stopped and stop being billed for. The exception to this are load balancers, as the IP address is persisted during the time the cluster is suspended.

Resource Pricing

The following tables outlines the pricing per hour/per minute for:

1. GPU Resources:

ResourceSKUPrice (minute)Price (hour)
NVIDIA H100 SXM 80GBogx-k8s-h100sxm-800.08555.13
NVIDIA H100 80GBogx-k8s-h1000.0694084.16448
NVIDIA L4 24GBogx-k8s-l40.0190441.14261
NVIDIA L40S 48GBogx-k8s-l40s0.0360002.16000

2. Other Resources:

ResourceSKUPrice (minute)Price (hour)
Memory (MB)ogx-k8s-mem0.000000080.000005
CPU (1/100)ogx-k8s-vcpu0.000001670.0001
Load Balancerogx-k8s-lb0.00030150.01809

You can read more about billing in our Support section.