Schema:

ACCOUNT_USAGE

CORTEX_REST_API_RATE_LIMIT_POLICIES view

Query the CORTEX_REST_API_RATE_LIMIT_POLICIES view to see the per-model rate limit policies for Cortex REST API endpoints in your account. The view shows the requests-per-minute (RPM) and tokens-per-minute (TPM) limits for each model.

Columns

Column Name

Data Type

Description

MODEL_NAME

TEXT

Name of the model for which the rate limit policy applies.

RPM

NUMBER

The requests-per-minute limit for the model. A NULL value means no RPM limit is enforced.

TPM

NUMBER

The tokens-per-minute limit for the model. A NULL value means no TPM limit is enforced.

Usage notes

  • Latency for the view may be up to 360 minutes (6 hours).

  • The view is scoped to the account that runs the query.