The pillars beyond metrics, explained from first principles. Learn the three signals, structured logging and Loki, distributed tracing, OpenTelemetry and Grafana — and remember it with spaced repetition.
Observability is the ability to understand what is happening inside a system from the telemetry it emits — enough to debug problems you did not anticipate, not just the ones you set alerts for. It rests on three signals: metrics (aggregatable numbers over time), logs (discrete timestamped events), and traces (the causal path of one request across services). Each answers a different question, and the real skill is knowing which to reach for.
Metrics are the province of the Prometheus track; this track covers everything around them. It grounds the RED and USE methods, then goes deep on the other two pillars: structured logging and how Grafana Loki stays cheap by indexing labels instead of full text, and distributed tracing — spans, context propagation, and head- vs tail-based sampling with backends like Tempo and Jaeger.
It ties the picture together with OpenTelemetry — the vendor-neutral standard for producing telemetry, its Collector pipeline and OTLP — and Grafana, the layer where you visualize and alert on all of it. Spaced repetition keeps the concepts sharp for when you are on call and need them most.
Each module is a set of flashcards — 54 in total. Answer, review, and watch your knowledge grow from seed to full bloom.
metrics, logs and traces — what each signal is for
10 cardsstructured logging, Loki architecture and LogQL
11 cardsspans, context propagation, sampling, Tempo and Jaeger
11 cardsOTel API/SDK, the Collector, OTLP and instrumentation
11 cardsdashboards, panels, data sources, variables and alerting
11 cardsA taste of the real flashcards. Pick an answer, then reveal the explanation.
What are the three primary telemetry signals in observability?
What does Grafana Loki actually index for each log entry?
What is a span in distributed tracing?
What is OpenTelemetry?
Each card is one practical concept with multiple options. Pick what you think is right.
See the correct option plus a clear explanation, and a link to deeper docs when one is available.
A spaced-repetition engine (SM-2 or FSRS) resurfaces each card just before you would forget it.
Knowing when to reach for a metric, a log or a trace is what lets you diagnose novel failures instead of guessing.
OpenTelemetry and the Grafana stack are open standards — the skills transfer across employers and tools.
Understanding Loki labels and cardinality is how you keep logs queryable without an eye-watering bill.
The three pillars, tracing and OpenTelemetry are staple topics in SRE, DevOps and platform interviews.
The Prometheus track owns metrics — PromQL, metric types, exporters and Alertmanager. This track covers the other two pillars (logs and traces), OpenTelemetry, and Grafana, so the two are siblings that together complete the observability picture.
No. The track starts from what observability means and the three signals, so beginners and engineers formalizing what they already half-know both benefit.
Yes. Dedicated modules cover OpenTelemetry (the API/SDK, the Collector pipeline, OTLP and instrumentation) and Grafana (dashboards, data sources, variables and unified alerting), alongside logs/Loki and distributed tracing.
Yes, completely free. No registration or credit card is required, and all your progress is stored locally in your browser.
Plant your first seed today. Ten minutes a day is all it takes to turn telemetry into understanding.