LLM APIs are billed per token, counting both your input and the model's output. It's not a flat per-request, per-minute, or per-user charge.