AI · Flashcard

How is LLM API usage usually billed?

  • APer token — you pay for the tokens in your input and in the output combined
  • BPer request — every call costs a flat fee no matter how long the call is
  • CPer minute — you are billed for the total time the model spends running it
  • DPer user — a fixed monthly charge covers unlimited calls for each account

Why this is the answer

LLM APIs are billed per token, counting both your input and the model's output. It's not a flat per-request, per-minute, or per-user charge.

Official docs
Study in Gnoseed →