- Schema:
CORTEX_REST_API_RATE_LIMIT_POLICIES view¶
Query the CORTEX_REST_API_RATE_LIMIT_POLICIES view to see the per-model rate limit policies for Cortex REST API endpoints in your account. The view shows the requests-per-minute (RPM) and tokens-per-minute (TPM) limits for each model.
Columns¶
Column Name |
Data Type |
Description |
|---|---|---|
MODEL_NAME |
TEXT |
Name of the model for which the rate limit policy applies. |
RPM |
NUMBER |
The requests-per-minute limit for the model. A NULL value means no RPM limit is enforced. |
TPM |
NUMBER |
The tokens-per-minute limit for the model. A NULL value means no TPM limit is enforced. |
Usage notes¶
Latency for the view may be up to 360 minutes (6 hours).
The view is scoped to the account that runs the query.