- Schema:
CORTEX_ REST_ API_ RATE_ LIMIT_ POLICIES view¶
Query the CORTEX_REST_API_RATE_LIMIT_POLICIES view to see the per-model rate limit policies for Cortex REST API endpoints in your account. The view shows the requests-per-minute (RPM) and tokens-per-minute (TPM) limits for each model.
Columns¶
| Column Name | Data Type | Description |
|---|---|---|
| MODEL_NAME | TEXT | Name of the model for which the rate limit policy applies. |
| RPM | NUMBER | The requests-per-minute limit for the model. A NULL value means no RPM limit is enforced. |
| TPM | NUMBER | The tokens-per-minute limit for the model. A NULL value means no TPM limit is enforced. |
Usage notes¶
- Latency for the view may be up to 360 minutes (6 hours).
- The view is scoped to the account that runs the query.