TTFT captures how responsive a stream feels — it covers queuing and prefill before generation starts, and is separate from total or per-token speed.