May 05, 2025: Snowflake Cortex Provisioned Throughput (General availability)¶
With Provisioned Throughput, a new capability in Snowflake Cortex, you can reserve throughput for managed inference.
Use Provisioned Throughput for the following tasks:
Reserve throughput for specific time periods using provisioned throughput units (PTUs).
Allocate capacity for supported models.
Scale throughput based on workload requirements with minimum and incremental configurations.