Make a model your own — without guesswork. Learn when to fine-tune, when to reach for RAG, and how LoRA, distillation and good evaluation actually work, remembered with spaced repetition.
Out of the box, a general-purpose LLM knows a lot but nothing about your domain, your data or your house style. Customizing a model is how you close that gap — and the first real skill is knowing which tool the job needs, because the options trade off cost, freshness and effort very differently.
Broadly there are two levers. Fine-tuning adjusts the model's own weights so it internalizes a behavior or format; RAG leaves the weights alone and feeds relevant facts in at query time. Around them sit the efficiency techniques — LoRA and PEFT to fine-tune cheaply, distillation and quantization to shrink and speed up — plus the data and evaluation work that decides whether any of it actually helped.
This track breaks customization into bite-sized, practical questions and uses spaced repetition so the trade-offs stick — so you reach for fine-tuning, RAG or a smaller quantized model on purpose, not by habit.
Each module is a set of practice cards — 74 in total. Answer, review, and watch your knowledge grow from seed to full bloom.
Fine-tuning fundamentals — what it is, what it teaches, when it helps versus prompting or RAG, and its risks.
15 cardsLoRA and PEFT — parameter-efficient fine-tuning, why it is cheap, adapters, QLoRA, and rank trade-offs.
15 cardsRAG versus fine-tuning — which to choose for facts, behavior, freshness, citation, and cost, and how to combine them.
14 cardsDistillation and quantization — shrinking and speeding up models, precision trade-offs, and how the techniques differ.
15 cardsData and evaluation — what data fine-tuning needs, validation sets, measuring improvement, and avoiding overfitting.
15 cardsA taste of the real cards. Pick an answer, then reveal the explanation.
What is fine-tuning of an LLM?
What is PEFT (parameter-efficient fine-tuning)?
What is the best approach to give a model up-to-date facts?
What is knowledge distillation?
Each card is one practical concept with multiple options. Pick what you think is right.
See the correct option plus a clear explanation, and a link to deeper docs when one is available.
A spaced-repetition engine (SM-2 or FSRS) resurfaces each card just before you would forget it.
Fine-tune, RAG or prompt? Knowing the trade-offs saves weeks of effort spent on the wrong approach.
LoRA, QLoRA and PEFT make adapting a model affordable — understanding them is the difference between feasible and not.
Distillation and quantization cut size and latency; knowing the precision trade-offs keeps quality where you need it.
Validation sets and honest evaluation are what separate a real improvement from a confident-looking regression.
It helps a lot. This track assumes you know the basics of tokens, training and prompting — the AI & LLM Fundamentals and Prompt Engineering tracks pair naturally with it.
Both, because the real skill is choosing between them. A whole module is dedicated to RAG vs fine-tuning: facts and freshness usually favor RAG, behavior and format favor fine-tuning, and often you combine them.
Yes, completely free. No registration or credit card is required, and all your progress is stored locally in your browser.
About 10 minutes a day. Spaced repetition means short, frequent sessions beat long cramming, so the trade-offs stick.
Plant your first seed today. Ten minutes a day is all it takes to know exactly when to fine-tune, when to RAG, and when to leave the model alone.