How is LLM API usage usually billed?

Question

Accepted Answer

Per token — you pay for the tokens in your input and in the output combined — LLM APIs are billed per token, counting both your input and the model's output. It's not a flat per-request, per-minute, or per-user charge.

Answer

Per request — every call costs a flat fee no matter how long the call is

Answer

Per minute — you are billed for the total time the model spends running it

Answer

Per user — a fixed monthly charge covers unlimited calls for each account

How is LLM API usage usually billed?

Why this is the answer

More Building with LLMs: APIs, Tokens & Cost flashcards