⌂ Home

Jobs and Batch Processing

Interactive guide to run-to-completion workloads, retries, completions, and parallel batch execution in Kubernetes.

A Job is about successful completion, not continuous availability. That changes how Kubernetes measures success, handles retries, and cleans up Pods.

Core Model

Understand the Concept First

Repository YAML Files:

k8s/labs/workloads/jobs.yaml — Job manifest demonstrating run-to-completion work with Pod template and completion settings.

Run to completion

Jobs are the right workload for migrations, reports, calculations, and one-time administrative tasks.

Retry-aware

A failed Pod can be retried until backoffLimit is reached.

Parallel capable

Jobs can run one Pod or many Pods in parallel depending on the task.

Lifecycle Flow

Job Controller Lifecycle

Create Job

The Job declares a Pod template plus success and retry settings.

Launch Pods

The Job controller creates Pods to execute the batch task.

Track success

Completed Pods count toward the desired number of successful completions.

Retry failures

Failed runs may be retried depending on restartPolicy and backoffLimit.

Finish and retain state

The Job reaches a completed or failed state once its criteria are met.

Jobs are controller-driven just like Deployments, but the success metric is completions rather than continuously running replicas.

Visual Diagrams

Interactive Job Patterns

Simple Job Lifecycle

Parallel Job Execution (parallelism: 3)

Failure Handling with Backoff Limit

Click the buttons above to explore different Job execution patterns. Hover over diagram elements for tooltips.

YAML and Commands

Examples You Can Recognize Quickly

Basic Job

apiVersion: batch/v1
kind: Job
metadata:
  name: pi
spec:
  template:
    spec:
      containers:
      - name: pi
        image: perl
        command: ["perl", "-Mbignum=bpi", "-wle", "print bpi(2000)"]
      restartPolicy: Never
  backoffLimit: 4

Useful Commands

kubectl get jobs
kubectl describe job pi
kubectl get pods -l job-name=pi
kubectl logs -l job-name=pi

Decision Guide

Job vs Deployment

Feature	Job	Deployment
Purpose	Run to completion	Keep application running
Lifecycle	Ends when work finishes	Runs indefinitely
Success criteria	Completion count reached	Desired replicas remain available
Restart policy	Never or OnFailure	Controller keeps Pods replaced continuously

If the workload should stop when the work is done, it should usually be a Job rather than a Deployment.

Use It Well

Practice and Real-World Thinking

Database migrations

Run schema changes before or during rollout in a controlled way.

Batch reports

Generate output files, analytics, or long-running calculations without long-lived Pods.

Administrative automation

Backups, repair tasks, and one-time system operations fit naturally into Jobs.