DevOps · 6 modules

Kubernetes Troubleshooting Scenarios

Practice Kubernetes the way you hit it in production — as scenarios. Diagnose real failures across architecture, workloads, networking, storage, configuration and security, and remember the fixes with spaced repetition.

practice cards
80
practice cards
per day
~10 min
per day
level
Intermediate → Advanced
level
modules
6
modules
About this topic

Learn Kubernetes by debugging it

Knowing the Kubernetes primitives is one thing; diagnosing why a rollout is throwing 503s, a PVC is stuck Pending, or a Service returns no endpoints is another. This track is built entirely from realistic scenarios — a symptom, some evidence, and the question of what is actually going wrong.

Each card drops you into a concrete production situation across cluster architecture, workloads, networking, storage, configuration and security, then walks through the root cause and the fix. It is the difference between knowing the docs and knowing what to do at 2 a.m.

Spaced repetition resurfaces these patterns so the next time you see the symptom, the cause comes to mind fast.

What you'll learn

6 modules, seed to bloom

Each module is a set of practice cards — 80 in total. Answer, review, and watch your knowledge grow from seed to full bloom.

Cluster Architecture (Advanced)

Scenario-based questions on control plane, nodes, and core components

15 cards

Workloads & Pods (Advanced)

Scenario-based questions on Pods, Deployments, StatefulSets, DaemonSets, Jobs

5 cards

Networking & Services (Advanced)

Scenario-based questions on Services, Ingress, DNS, and traffic routing

15 cards

Storage (Advanced)

Scenario-based questions on Volumes, PVs, PVCs, StorageClass, and CSI

15 cards

Configuration (Advanced)

Scenario-based questions on ConfigMaps, Secrets, resource management, and QoS

15 cards

Security (Advanced)

Scenario-based questions on RBAC, ServiceAccounts, NetworkPolicy, and Pod security

15 cards
Try before you plant

Sample questions

A taste of the real cards. Pick an answer, then reveal the explanation.

Sample · Kubernetes Troubleshooting Scenarios

A developer creates a Service targeting Pods with label 'app: backend', but the Service returns no endpoints. Running 'kubectl get endpoints my-service' shows an empty list. The Pods are running and healthy. What is the most likely cause?

  • AThe Service selector doesn't match the Pods' labels — either the label key/value is misspelled or the Pods use a different label (e.g., 'app: back-end' vs 'app: backend'). Label selectors require exact string matches with no fuzzy matching
  • BThe Service is in a different namespace than the Pods, and Services can only route to Pods in different namespaces if the Pods have the annotation 'service.kubernetes.io/cross-namespace: true'
  • CThe Pods are running but not yet in Ready state, and Services only include Pods that have been running for at least 30 seconds to avoid routing to containers that are still initializing
  • DThe kube-proxy hasn't synced the new Service yet — after creating a Service, there is a mandatory 5-minute propagation delay before endpoints are populated
Sample · Kubernetes Troubleshooting Scenarios

A developer creates a PersistentVolumeClaim requesting 10Gi of storage with StorageClass 'fast-ssd'. The PVC stays in Pending state. Running 'kubectl describe pvc' shows: 'waiting for first consumer to be created before binding'. What does this mean?

  • AThe StorageClass has volumeBindingMode set to WaitForFirstConsumer, which delays PV provisioning until a Pod actually references the PVC. This ensures the volume is created in the same availability zone as the Pod, preventing cross-zone scheduling failures
  • BThe cluster has run out of SSD storage capacity, and the PVC is waiting in a queue until storage is freed by other workloads being terminated
  • CThe PVC is waiting for an administrator to manually approve the storage request, because the 'fast-ssd' StorageClass requires manual provisioning approval for requests over 5Gi
  • DThe CSI driver for the 'fast-ssd' StorageClass has crashed and the PVC is retrying the provisioning request every 30 seconds until the driver recovers
How Gnoseed works

Learn it once, keep it for good

1

Answer a question

Each card is one practical concept with multiple options. Pick what you think is right.

2

Get the full answer

See the correct option plus a clear explanation, and a link to deeper docs when one is available.

3

Review at the right time

A spaced-repetition engine (SM-2 or FSRS) resurfaces each card just before you would forget it.

Why learn this

Why scenario practice works

Train the real skill

Production work is diagnosis, not recall. These cards practice the actual thing you get paid for.

Pattern-match faster

Seeing many failures means the next outage looks familiar instead of foreign.

Interview-ready

"Walk me through how you would debug this" is a staple of senior Kubernetes interviews.

FAQ

Common questions

Is this for beginners? +

It assumes you know the Kubernetes basics. If the primitives are new, start with the Kubernetes Fundamentals track first, then come here.

How long does it take? +

About 10 minutes a day. Scenario questions take a little longer to read, so the timing adapts — short, frequent sessions still win.

Is it free? +

Yes, completely free. No registration or credit card is required, and all your progress is stored locally in your browser.

Ready to debug Kubernetes with confidence?

Plant your first seed today. Ten minutes a day is all it takes to grow real production troubleshooting instincts.

Start learning free