AI · 5 modules

Customizing LLMs: Fine-tuning, LoRA & RAG

Make a model your own — without guesswork. Learn when to fine-tune, when to reach for RAG, and how LoRA, distillation and good evaluation actually work, remembered with spaced repetition.

Plant your first seed See sample questions

practice cards: 74; practice cards
per day: ~10 min; per day
level: Intermediate; level
modules: 5; modules

About this topic

What does customizing a model mean?

Out of the box, a general-purpose LLM knows a lot but nothing about your domain, your data or your house style. Customizing a model is how you close that gap — and the first real skill is knowing which tool the job needs, because the options trade off cost, freshness and effort very differently.

Broadly there are two levers. Fine-tuning adjusts the model's own weights so it internalizes a behavior or format; RAG leaves the weights alone and feeds relevant facts in at query time. Around them sit the efficiency techniques — LoRA and PEFT to fine-tune cheaply, distillation and quantization to shrink and speed up — plus the data and evaluation work that decides whether any of it actually helped.

This track breaks customization into bite-sized, practical questions and uses spaced repetition so the trade-offs stick — so you reach for fine-tuning, RAG or a smaller quantized model on purpose, not by habit.

What you'll learn

5 modules, seed to bloom

Each module is a set of practice cards — 74 in total. Answer, review, and watch your knowledge grow from seed to full bloom.

Fine-tuning Fundamentals

Fine-tuning fundamentals — what it is, what it teaches, when it helps versus prompting or RAG, and its risks.

15 cards

LoRA & PEFT

LoRA and PEFT — parameter-efficient fine-tuning, why it is cheap, adapters, QLoRA, and rank trade-offs.

15 cards

RAG vs Fine-tuning

RAG versus fine-tuning — which to choose for facts, behavior, freshness, citation, and cost, and how to combine them.

14 cards

Distillation & Quantization

Distillation and quantization — shrinking and speeding up models, precision trade-offs, and how the techniques differ.

15 cards

Data & Evaluation

Data and evaluation — what data fine-tuning needs, validation sets, measuring improvement, and avoiding overfitting.

15 cards

Try before you plant

Sample questions

A taste of the real cards. Pick an answer, then reveal the explanation.

Sample · Customizing LLMs: Fine-tuning, LoRA & RAG

What is fine-tuning of an LLM?

AFurther training a pretrained model on custom data to adjust its weights
BWriting detailed prompts that steer a pretrained model at request time
CAttaching a document store so the model can look facts up at query time
DCompressing a pretrained model to lower precision for cheaper inference

Sample · Customizing LLMs: Fine-tuning, LoRA & RAG

What is PEFT (parameter-efficient fine-tuning)?

AFine-tuning only a small subset of parameters instead of all of them
BFine-tuning every weight in the model on a very small dataset
CRunning a model at lower precision to save memory at inference
DAdding retrieved documents to the prompt instead of training at all

Sample · Customizing LLMs: Fine-tuning, LoRA & RAG

What is the best approach to give a model up-to-date facts?

ARAG — retrieve current documents and add them to the prompt
BFine-tuning — bake the facts into the model's weights once
CDistillation — compress the facts into a smaller student model
DQuantization — lower the precision so more facts fit in memory

Sample · Customizing LLMs: Fine-tuning, LoRA & RAG

What is knowledge distillation?

ATraining a small student model to imitate a larger teacher model
BTraining a large model on the outputs of many smaller models
CRemoving low-importance weights to shrink an already-trained model
DLowering the precision of a model's weights to save on memory

How Gnoseed works

Learn it once, keep it for good

Answer a question

Each card is one practical concept with multiple options. Pick what you think is right.

Get the full answer

See the correct option plus a clear explanation, and a link to deeper docs when one is available.

Review at the right time

A spaced-repetition engine (SM-2 or FSRS) resurfaces each card just before you would forget it.

Why learn this

Why customization is worth your time

Pick the right tool

Fine-tune, RAG or prompt? Knowing the trade-offs saves weeks of effort spent on the wrong approach.

Fine-tune on a budget

LoRA, QLoRA and PEFT make adapting a model affordable — understanding them is the difference between feasible and not.

Ship smaller, faster models

Distillation and quantization cut size and latency; knowing the precision trade-offs keeps quality where you need it.

Know if it worked

Validation sets and honest evaluation are what separate a real improvement from a confident-looking regression.

FAQ

Common questions

Should I understand how LLMs work first? +

It helps a lot. This track assumes you know the basics of tokens, training and prompting — the AI & LLM Fundamentals and Prompt Engineering tracks pair naturally with it.

Fine-tuning or RAG — which should I learn? +

Both, because the real skill is choosing between them. A whole module is dedicated to RAG vs fine-tuning: facts and freshness usually favor RAG, behavior and format favor fine-tuning, and often you combine them.

Is it free? +

Yes, completely free. No registration or credit card is required, and all your progress is stored locally in your browser.

How long does it take? +

About 10 minutes a day. Spaced repetition means short, frequent sessions beat long cramming, so the trade-offs stick.

Ready to customize models with intent?

Plant your first seed today. Ten minutes a day is all it takes to know exactly when to fine-tune, when to RAG, and when to leave the model alone.

Start learning free

Customizing LLMs: Fine-tuning, LoRA & RAG

What does customizing a model mean?

5 modules, seed to bloom

Fine-tuning Fundamentals

LoRA & PEFT

RAG vs Fine-tuning

Distillation & Quantization

Data & Evaluation

Sample questions

Learn it once, keep it for good

Answer a question

Get the full answer

Review at the right time

Why customization is worth your time

Pick the right tool

Fine-tune on a budget

Ship smaller, faster models

Know if it worked

Common questions

Related topics

Ready to customize models with intent?