AI cost management and governance¶
Snowflake gives you a consistent way to understand, monitor, and manage AI usage alongside the rest of your platform activity. Across AI features, pricing is primarily based on consumption, including token-based usage where applicable, so teams can align spend to actual usage instead of fixed capacity. To support cost transparency, Cortex AI provides usage views that help you analyze activity over time, break down consumption, and connect usage to billing workflows already used across your organization. These views can be used for reporting, governance, showback, and internal monitoring. For detailed pricing by feature, model, and unit of consumption, refer to the consumption table, which provides the current pricing structure across Snowflake AI capabilities.
Usage views¶
Snowflake provides usage views to help you track AI consumption using the same core approach used across the platform. These views support analysis of usage over time and can help teams understand how AI activity maps to overall spend, whether they are monitoring adoption, reviewing trends, or supporting internal reporting. This allows finance, platform, and engineering teams to work from a common system of record when evaluating usage. Pricing details remain available in the consumption table, which outlines how individual AI features are billed. Together, usage views and pricing documentation provide a foundation for understanding and managing AI costs across your Snowflake environment.
Usage views for total cost¶
These views should be used when calculating AI usage and AI-related spend. Together, they provide the standard foundation for cost reporting across AI features.
Name |
SERVICE_TYPE |
Time Zone |
Units |
DATES |
|---|---|---|---|---|
CORTEX_AGENTS |
UTC Converted to local [1] |
Tokens, Tools |
Data begins 11/10/2025 |
|
AI_SERVICES |
UTC Converted to local [1] |
Tokens |
Data begins 1/5/2026 |
|
CORTEX_CODE_CLI |
UTC |
Tokens, Tools |
Data begins 2/16/2026 |
|
CORTEX_CODE_SNOWSIGHT |
UTC |
Tokens, Tools |
Data begins 3/13/2026, billing begins 4/1/2026 |
|
AI_SERVICES |
UTC Converted to local [1] |
Messages |
365 days of data |
|
AI_SERVICES |
UTC Converted to local [1] |
Fine-tuning time |
365 days of data |
|
AI_SERVICES |
UTC |
PTU Hours |
365 days of data |
|
AI_SERVICES |
Local |
Serving time, Tokens |
365 days of data |
|
SNOWFLAKE_INTELLIGENCE |
UTC Converted to local [1] |
Tokens, Tools |
Data begins 11/10/2025 |
|
AI_INFERENCE |
UTC |
Tokens (note: in currency) |
Data begins 11/1/2025 |
Coming Soon: Cortex AI Guardrails Account Usage View
[1] UTC Converted to local means if your account is altered to local time it will display in local time. The underlying data is still in UTC.
[2] CORTEX_REST_API_USAGE_HISTORY is billed in dollars and is not currently shown in account level DAILY_METERING_HISTORY.
[3] CORTEX_SEARCH_DAILY_USAGE_HISTORY includes embeddings which need to be excluded from combined calculations as they are also shown in CORTEX_AI_FUNCTIONS_USAGE_HISTORY.
Usage views for additional analysis¶
Use these views when you need more granular or feature-specific insight. They complement the primary views, but are not intended to serve as the standard source for AI cost totals.
Name |
Service Type |
Time Zone |
Dates |
Notes |
|---|---|---|---|---|
AI_SERVICES |
UTC Converted to local [1] |
365 days of data |
This credit total includes the embedding costs captured in CORTEX_AI_FUNCTIONS_USAGE_HISTORY. |
|
CORTEX_SEARCH_BATCH_QUERY_USAGE_HISTORY |
AI_SERVICES |
UTC Converted to local [1] |
Data begins on 3/26/2026 |
This credit total includes the embedding costs captured in CORTEX_AI_FUNCTIONS_USAGE_HISTORY. |
AI_SERVICES |
Data starts on 11/21/2025 |
Slated for deprecation on 1/15/2027 This view includes totals of all functions except AI_EXTRACT. |
||
AI_SERVICES |
365 days of data |
Slated for deprecation on This view includes document processing now captured in CORTEX_AI_FUNCTIONS_USAGE_HISTORY. |
||
AI_SERVICES |
Data ends on 11/21/2025 |
Slated for deprecation on 11/22/2026 Please use CORTEX_AI_FUNCTIONS_USAGE_HISTORY. |
||
AI_SERVICES |
Data ends on 11/21/2025 |
Slated for deprecation on 11/22/2026 Please use CORTEX_AI_FUNCTIONS_USAGE_HISTORY. |
Total Cost of Operations¶
Tokens, Messages, and others are not the only ways in which you are billed for Cortex AI, you also are billed for the query, warehouse time, and any other associated Snowflake charges. Through query_id, warehouse_id, user_id you should be able to calculate your total cost of operation. For more details please see the associated usage view or contact support.
Budget features¶
Snowflake budgets help organizations monitor credit usage and respond when spending approaches or exceeds configured thresholds. These features can support internal planning, alerting, and broader governance processes for AI usage as part of an overall cost management strategy. A budget defines a monthly spending limit for an account or for a custom group of Snowflake objects. Budgets can send notifications when spend is projected to exceed the configured limit, and Snowflake also supports custom actions for budgets based on either projected or actual consumption. This allows teams to pair spend monitoring with operational responses, using the same core budgeting model across Snowflake cost management workflows.
Resource budgets for AI features¶
Resource budgets let administrators define a monthly credit limit for a tagged Cortex Agent object and evaluate spend against that budget on a periodic basis. Because they use Snowflake’s tag-based cost attribution model, they fit into broader governance and budget management patterns already used across the platform. Snowflake also announced resource budgets for Snowflake Intelligence on the same date, extending this model across additional AI experiences.
Budget capability by feature¶
Feature |
Budget capabilities |
|---|---|
Cortex Agents |
|
Cortex AI Functions |
|
Cortex Code CLI (Consumption) |
|
Cortex Code in Snowsight |
|
Snowflake Intelligence |
|
Cortex Search |
Planned resource budgets for the coming year |
Not supported nor planned: Cortex Analyst, Cortex Fine-tuning.
Budget timing, enforcement, and automated actions¶
For Resource Budgets and Shared Resource Budgets you can attach stored procedures that are executed when spending reaches specific thresholds, which are expressed as a percentage of the spending limit and apply to the monthly budget period. Budget evaluation and enforcement are calculated periodically rather than instantaneously. After a budget threshold is exceeded, actions can take up to eight hours to take effect under normal operation, or up to two hours when using the latency-optimized option. Budgets are useful for ongoing spend management and policy enforcement, while still being part of a broader cost governance strategy that may also include usage monitoring and internal operational review.