AI · Flashcard

Which parts of a request count toward token cost?

  • ABoth input and output — your prompt and the model's reply are each counted
  • BOnly the input — just your prompt is billed, and the reply is always free
  • COnly the output — the model's reply is billed, and your prompt is the free part
  • DNeither directly — billing is by the request count, not by any token totals

Why this is the answer

Both the input prompt and the output reply are counted toward token cost. Neither is free, and billing isn't by request count.

Official docs
Study in Gnoseed →