AI · Flashcard

What is a "rate limit" on an API?

  • AA cap on calls per time window — how many requests you may send per minute
  • BA cap on the reply length — the most tokens a single answer may contain
  • CA cap on the model size — the largest model your account is allowed to use
  • DA cap on accuracy spent — the share of correct answers granted to you per day

Why this is the answer

A rate limit caps how many requests (or tokens) you can send per time window. It's not a reply-length, model-size, or accuracy cap.

Official docs
Study in Gnoseed →