⌂ Home

Resource Quotas and Limits

Interactive guide to requests, limits, namespace quotas, and how Kubernetes enforces resource fairness in shared clusters.

Requests, limits, and namespace ResourceQuota solve different but connected problems: scheduling guarantees, runtime protection, and namespace-level governance.

Core Model

Understand the Concept First

Requests

Requests define the minimum resources the scheduler must reserve for a container.

Limits

Limits cap how much CPU or memory the container may consume at runtime.

ResourceQuota

ResourceQuota restricts aggregate consumption across a namespace.

Lifecycle Flow

Admission and Enforcement Flow

Requests influence scheduling, limits influence runtime behavior, and quotas influence namespace admission decisions.

YAML and Commands

Examples You Can Recognize Quickly

ResourceQuota Example

apiVersion: v1
kind: ResourceQuota
metadata:
  name: mem-cpu-demo
  namespace: quotaz
spec:
  hard:
    requests.cpu: "1"
    requests.memory: 1Gi
    limits.cpu: "2"
    limits.memory: 2Gi

Pod with Resource Specs

apiVersion: v1
kind: Pod
metadata:
  name: resource-demo
spec:
  containers:
  - name: app
    image: nginx
    resources:
      requests:
        cpu: 500m        # Scheduler reserves 500m
        memory: 256Mi    # Scheduler reserves 256Mi
      limits:
        cpu: 1000m       # Throttled if exceeding
        memory: 512Mi    # OOMKilled if exceeding

Useful Commands

# Check namespace quota
kubectl get resourcequota -n quotaz
kubectl describe resourcequota mem-cpu-demo -n quotaz

# Check current usage
kubectl describe namespace quotaz

# View pod resource requests/limits
kubectl describe pod resource-demo

Decision Guide

Requests, Limits, and Quotas

Layer	Scope	Purpose
Request	Container	Guarantee minimum resources for scheduling
Limit	Container	Prevent a container from exceeding allowed resources
ResourceQuota	Namespace	Control total resource consumption across workloads

Most confusion disappears when you remember that requests are about placement, limits are about runtime, and quotas are about aggregate namespace policy.

Use It Well

Practice and Real-World Thinking

Multi-tenant governance

Keep one namespace or team from consuming the whole cluster.

Cost boundaries

Enforce budget-oriented ceilings in development or shared environments.

Stability protection

Prevent runaway workloads from destabilizing other workloads.