Cortex Analyst¶

Get started with Cortex Analyst

Overview¶

Cortex Analyst is a fully-managed, LLM-powered Snowflake Cortex feature that helps you create applications capable of reliably answering business questions based on your structured data in Snowflake. With Cortex Analyst, business users can ask questions in natural language and receive direct answers without writing SQL. Available as a convenient REST API, Cortex Analyst can be seamlessly integrated into any application.

Building a production-grade conversational self-service analytics solution requires a service that generates accurate text-to-SQL responses. For most teams, developing such a service that successfully balances accuracy, latency, and costs is a daunting task. Cortex Analyst simplifies this process by providing a fully managed, sophisticated agentic AI system that handles all of these complexities, generating highly accurate text-to-SQL responses. It helps you accelerate the delivery of high-precision, self-serve conversational analytics to business teams, while avoiding time sinks such as complex RAG solution patterns, model experimentation, and GPU capacity planning. The generated SQL queries are executed against the scalable Snowflake engine, ensuring industry-leading price performance and lower total cost of ownership (TCO).

Tip

Want to get started with Cortex Analyst quickly? Try the Tutorial: Answer questions about time-series revenue data with Cortex Analyst tutorial.

Key features¶

Self-serve analytics via natural language queries. Delight your business teams and non-technical users with instant answers and insights from their structured data in Snowflake. Using Cortex Analyst, you can build downstream chat applications that allow your users to ask questions using natural language and receive accurate answers on the fly.
Convenient REST API for integration into existing business workflows. Cortex Analyst takes an API-first approach, giving you full control over the end user experience. Easily integrate Cortex Analyst into existing business tools and platforms, bringing the power of data insights to where business users already operate, such as Streamlit apps, Slack, Teams, custom chat interfaces, and more.
Powered by state-of-the-art large language models: By default, Cortex Analyst is powered by industry-leading models which run securely inside Snowflake Cortex, Snowflake’s intelligent, fully managed AI service. At runtime, Cortex Analyst selects the best combination of models to ensure the highest accuracy and performance for each query. As LLMs evolve, Snowflake may add more models to the mix to further improve performance and accuracy.
Semantic models for high precision and accuracy: Generic AI solutions often struggle with text-to-SQL conversions when given only a database schema, as schemas lack critical knowledge like business process definitions and metrics handling. Cortex Analyst overcomes this limitation by using a semantic model to bridge the gap between business users and databases. Captured in a lightweight YAML file, the overall structure and concepts of the semantic model are similar to those of database schemas, but allow for a richer description of the semantic information around the data.

If you set up Cortex Analyst to answer questions from a large number of data sources, Cortex Analyst can automatically figure out which one to use. You don’t have to worry about specifying the right one with each query.
Security and governance. Snowflake’s privacy-first foundation and enterprise-grade security ensure that you can explore AI-driven use cases with confidence, knowing your data is protected by the highest standards of privacy and governance.
- Cortex Analyst does not train on Customer Data. We do not use your Customer Data to train or fine-tune any Model to be made available for use across our customer base. Additionally, for inference, Cortex Analyst uses the metadata provided in the semantic model YAML file (e.g., table names, column names, value type, descriptions, etc.) only for SQL-query generation. This SQL query is then executed in your Snowflake virtual warehouse to generate the final output.
- Data stays within Snowflake’s governance boundary. By default, Cortex Analyst is powered by Snowflake-hosted LLMs from Mistral and Meta, ensuring that no data, including metadata or prompts, leaves Snowflake’s governance boundary.
- Seamless integration with Snowflake’s Privacy and Governance features. Cortex Analyst fully integrates with Snowflake’s role-based access control (RBAC) policies, ensuring that SQL queries generated and executed adhere to all established access controls. This guarantees robust security and governance for your data.

Access control requirements¶

To make requests to Cortex Analyst, use a role with either the SNOWFLAKE.CORTEX_USER or SNOWFLAKE.CORTEX_ANALYST_USER database role. CORTEX_USER provides access to all Covered AI features, while CORTEX_ANALYST_USER provides access only to Cortex Analyst. For information about Covered AI features, see Legal notices.

To use Cortex Analyst with a semantic model, you also need the following privileges:

Privilege	Object
READ or WRITE	Stage that contains the semantic model YAML file, if the semantic model is uploaded to a stage.
USAGE	The Cortex Search services mentioned in the semantic model.
SELECT	The tables mentioned in the semantic model.

Requests to the Cortex Analyst API must include an authorization token. For details on how to authenticate to the API, see Authenticating Snowflake REST APIs with Snowflake.

Note that the example in this topic uses a session token to authenticate to a Snowflake account.

Limiting access to specific roles¶

By default, the CORTEX_USER role is granted to the PUBLIC role. The PUBLIC role is automatically granted to all users and roles. If you don’t want all users to have this privilege, you can revoke access to the PUBLIC role and grant access to specific roles. For more information, see Cortex LLM privileges.

To control access to specific semantic models, you can store the YAML file in a stage and control access to that stage.

Limiting access using the Cortex Analyst user role¶

To provide selective access to Cortex Analyst for specific users, use the SNOWFLAKE.CORTEX_ANALYST_USER database role. This role includes the privileges needed to call the Cortex Analyst API. For more information about Covered AI features, see Legal notices.

Important

If your user roles have the CORTEX_USER role, you must revoke access to the CORTEX_USER role. To revoke the CORTEX_USER database role from your user roles, run the following command using the ACCOUNTADMIN role:

REVOKE DATABASE ROLE SNOWFLAKE.CORTEX_USER FROM ROLE analyst;

Copy

To provide access to Cortex Analyst, use the ACCOUNTADMIN role to do the following:

Grant the SNOWFLAKE.CORTEX_ANALYST_USER database role to a custom role.
Assign this custom role to users.

Note

You can’t grant database roles directly to users. For more information, see GRANT DATABASE ROLE.

The following example:

Creates the custom role, cortex_analyst_user_role.
Grants it the CORTEX_ANALYST_USER database role.
Assigns this role to example_user.

USE ROLE ACCOUNTADMIN;
CREATE ROLE cortex_user_role;
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_ANALYST_USER TO ROLE cortex_analyst_user_role;

GRANT ROLE cortex_analyst_user_role TO USER example_user;

Copy

You can also grant access to Cortex Analyst through existing roles. For example, if you have an analyst role used by analysts in your organization, you can grant access with a single GRANT statement:

GRANT DATABASE ROLE SNOWFLAKE.CORTEX_ANALYST_USER TO ROLE analyst;

Copy

Region availability¶

Cortex Analyst is natively available in the following regions.

AWS ap-northeast-1 (Tokyo)
AWS ap-southeast-2 (Sydney)
AWS us-east-1 (Virginia)
AWS US East (Commercial Gov - N. Virginia)
AWS us-west-2 (Oregon)
AWS eu-central-1 (Frankfurt)
AWS eu-west-1 (Ireland)
Azure East US 2 (Virginia)
Azure West Europe (Netherlands)

If your Snowflake account is in a different cloud region, you can still use Cortex Analyst by leveraging Cross-region inference. Once cross-region inference is enabled, Cortex Analyst processes requests in other regions for models that are not available in your default region. For optimal performance, configure cross-region with AWS US regions.

Multi-turn conversation in Cortex Analyst¶

Cortex Analyst supports multi-turn conversations for data-related questions. This feature enables asking follow-up questions that build on previous queries, creating a more dynamic and interactive data exploration experience. For example, the user asks, “What is the month-over-month revenue growth for 2021 in Asia?”, then follows up with, “What about North America?”

Cortex Analyst recognizes the follow-up, retrieves the context from the initial query, and rephrases the second question as: “What is the month-over-month revenue growth for 2021 in North America?” Cortex Analyst then generates a SQL query to answer this question.

To use this feature, pass the conversation history in the messages field:

{
    "messages": [
        {
            "role": "user",
            "content": [
                {
                    "type": "text",
                    "text": "What is the month over month revenue growth for 2021 in Asia?"
                }
            ]
        },
        {
            "role": "analyst",
            "content": [
                {
                    "type": "text",
                    "text": "We interpreted your question as ..."
                },
                {
                    "type": "sql",
                    "statement": "SELECT * FROM table"
                }
            ]
        },
        {
            "role": "user",
            "content": [
                {
                    "type": "text",
                    "text": "What about North America?"
                }
            ]
        },
    ],
    "semantic_model_file": "@my_stage/my_semantic_model.yaml"
}

Copy

The conversation history is an array of messages in chronological order, where each message has a role and content. The role can be "user" (for previous questions) or "analyst" (for previous responses). Analyst responses have both text and SQL responses, as shown in the example above, while user messages have only text.

Important

Large language models like the ones used by Cortex Analyst do not store state between requests. The full history is processed for each new query in a conversation, with corresponding compute cost that increases with each round.

Known limitations in multi-turn conversations¶

Some of the following limitations might be addressed in future versions of Cortex Analyst.

Access to the results of previous SQL queries: Cortex Analyst doesn’t have access to results from previous SQL queries. For example, if you first ask, “What are my products?” and then ask, “What is the revenue of the second product?”, Cortex Analyst cannot refer to the list of products from the first query to get the second product.
General business insights: Cortex Analyst is limited to answering questions that can be resolved with SQL. It does not generate insights for broader business-related queries, such as “What trends do you observe?”
Long conversations: If a conversation includes too many turns or the user shifts intent frequently, Cortex Analyst might struggle to interpret the follow-up questions. In such cases, reset the conversation and start again.

Getting started¶

Developers can use the following resources to get started with Cortex Analyst:

Basic code example: The Cortex Analyst example in the following section provides a simple, easy-to-read script that helps you create an interactive app using Cortex Analyst.

Choose this option if you want a basic fundamental example to start with, and are comfortable with using Streamlit and making your own modifications. You can run this example either in Streamlit in Snowflake (SiS) or locally.
Snowflake Samples repository: If you’re instead looking for a more comprehensive implementation, the Cortex Analyst advanced SiS demo in the Snowflake Samples repository has all the features and options already set up. This repository is configured with various pre-built features that make deploying Cortex Analyst seamless and robust.

Choose this option if you are trying to test out the feature for the first time, or have fewer custom modifications to make.

Note

This is shown only as an example. Snowflake does not provide support for the below content, nor does Snowflake warrant that the below content is accurate.

To learn more, see the Cortex Analyst advanced SiS demo in the Snowflake Samples GitHub repository.

Cortex Analyst example¶

Follow these steps to create an interactive Streamlit in Snowflake (SiS) or standalone Streamlit app that uses Cortex Analyst.

Create a semantic model
Upload the semantic model to stage
Create and run a Streamlit in Snowflake app
Interact with the Streamlit in Snowflake app

Create a semantic model¶

A semantic model is a lightweight mechanism that addresses issues related to the language difference between business users and database definitions by allowing for the specification of additional semantic details about a dataset. These additional semantic details, like more descriptive names or synonyms, enable Cortex Analyst to answer data questions much more reliably.

Start with a list of questions you would like Cortex Analyst to answer. Based on that, decide on the dataset for your semantic model.
Create your semantic model YAML based on the specification. For convenience, try the Create a semantic view using the AI-assisted model generator.

Upload semantic model¶

You can upload a semantic model YAML file to a stage or pass the semantic model YAML as a string in the request body. If you upload a semantic model YAML to a stage, access to that semantic model is controlled by access to the stage it’s uploaded to. This means that any role with access to the stage can access the semantic models on that stage even if the role doesn’t have access to the tables that the models are based on. Ensure that roles granted access to a stage have SELECT access on all tables referenced in all semantic models on that stage.

Below is an example of how to set up the stages containing the semantic models. One stage (public) is accessible to all members of the organization, whereas another stage (sales) is only accessible to the sales_analyst role.

Create the database and schema for the stage. The following example creates a database named semantic_model with a schema named definition but you can use any valid identifier string for these names.

CREATE DATABASE semantic_model;
CREATE SCHEMA semantic_model.definitions;
GRANT USAGE ON DATABASE semantic_model TO ROLE PUBLIC;
GRANT USAGE ON SCHEMA semantic_model.definitions TO ROLE PUBLIC;

USE SCHEMA semantic_model.definitions;

Copy

Then create the stages for storing your semantic models:

CREATE STAGE public DIRECTORY = (ENABLE = TRUE);
GRANT READ ON STAGE public TO ROLE PUBLIC;

CREATE STAGE sales DIRECTORY = (ENABLE = TRUE);
GRANT READ ON STAGE sales TO ROLE sales_analyst;

Copy

In Snowsight, you can refresh the page and find the newly created stages in the database object explorer. You can open the stage page in a new tab and upload your YAML files in Snowsight.

Alternatively, you can use the Snowflake CLI client to upload from your local file system.

snow stage copy file:///path/to/local/file.yaml @sales

Copy

Creating a Streamlit in Snowflake App¶

This example shows you how to create a Streamlit in Snowflake app that takes a natural language question as input and calls Cortex Analyst to generate an answer based on the semantic model you provide.

Note

This is shown only as an example. Snowflake does not provide support for the below content, nor does Snowflake warrant that the below content is accurate.

For more information on creating and running Streamlit apps in Snowflake, see About Streamlit in Snowflake.

Follow the directions in Create a Streamlit app by using Snowsight to create a new Streamlit app in Snowsight.
Copy the Streamlit code from our GitHub repo into the code editor.
Replace the placeholder values with your account details.
To preview the app, select Run to update the content in the Streamlit preview pane.

Interact with the Streamlit App¶

Navigate to the Streamlit app in your browser or the Streamlit in Snowflake preview pane.
Start asking questions about your data in natural language (e.g. “What questions can I ask?”).

Create a standalone Streamlit app¶

You can also use the example code to build a standalone app.

Note

This is shown only as an example. Snowflake does not provide support for the below content, nor does Snowflake warrant that the below content is accurate.

Install Streamlit.
Create a Python file locally called analyst_api.py.
Copy the Streamlit code from our GitHub repo into the file.
Replace the placeholder values with your account details.
Run the Streamlit app using streamlit run analyst_api.py.

The database and schema specified in the code is the stage location for the semantic model YAML file. The role used in the Snowflake connector should have access to underlying data defined in semantic model.

For a more comprehensive implementation, see the Cortex Analyst advanced SiS demo in the Snowflake Samples GitHub repository. This repository is configured with various pre-built features that make deploying Cortex Analyst seamless and robust.

Disable Cortex Analyst functionality¶

If you do not want Cortex Analyst to be available in your account, disable the feature by changing the ENABLE_CORTEX_ANALYST parameter using the ACCOUNTADMIN role:

USE ROLE ACCOUNTADMIN;
ALTER ACCOUNT SET ENABLE_CORTEX_ANALYST = FALSE;

Copy

Parameter Type	Session
Data Type	BOOLEAN
Description	Controls whether Cortex Analyst functionality is enabled in your account.
Values	FALSE: Cortex Analyst functionality is not available. TRUE: Cortex Analyst functionality is available.
Default	TRUE

Control models used by Cortex Analyst¶

You can use model-level RBAC (role-based access control) to control access to the models used by Cortex Analyst. Each model is protected by a designated application role, and administrators can grant or revoke access to specific LLMs via these model-specific roles. For more information, see Role-based access control (RBAC).

Important

Model-level RBAC is an advanced feature intended for customers with specific regulatory or compliance requirements that dictate which models can be used and where they can be hosted. If you do not have such requirements, Snowflake recommends that you do not use this feature.

You cannot choose a model directly. Instead, Cortex Analyst assigns each request to a model, or to a combination of models, taking into account the following factors:

The models available in your Snowflake region.
The account’s cross-region inference configuration.
Any model-level RBAC restrictions you have established.

Tip

Different models produce different results. For consistent results, use the same Snowflake region, cross-region inference configuration, and model-level RBAC restrictions for all requests.

Cortex Analyst selects models in the following order of preference, using the highest-ranked model to which your role has access. If your role has access to none of these models, the request fails.

Anthropic Claude Sonnet 4
Anthropic Claude Sonnet 3.7
Anthropic Claude Sonnet 3.5
OpenAI GPT 4.1
Combination of Mistral Large 2 and Llama 3.1 70b

Cortex Analyst’s model selection behavior may change from time to time to take advantage of advances in model functionality.

Risks and limitations¶

Cortex Analyst relies upon the availability at least one supported model configuration. Disabling specific models reduces fallback options and increases the risk of query failures.

Model-level restrictions apply to all Cortex features that can use the model; it is not possible to restrict access to a model only in Cortex Analyst or in any other single Cortex feature.

Cost considerations¶

The credit rate usage for Cortex Analyst is based on the number of messages processed as outlined in the Snowflake Service Consumption Table. Only successful responses (HTTP 200) are counted. The number of tokens in each message only affects cost when Cortex Analyst is invoked using Cortex Agents. Otherwise, the number of tokens in each message does not affect cost.

Note

The above charges cover AI costs for text-to-SQL. Additional warehouse costs apply when you execute the SQL generated by Cortex Analyst.

Monitoring the cost of Cortex Analyst¶

To view credit consumption for Cortex Analyst, use the CORTEX_ANALYST_USAGE_HISTORY view. For example:

SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_ANALYST_USAGE_HISTORY;

Copy

Usage of Cortex Analyst also appears in the METERING_HISTORY view in the ACCOUNT_USAGE schema with a service type of AI_SERVICES.

Legal notices¶

Where your configuration of Cortex Analyst uses a model provided on the Model and Service Flow-down Terms, your use of that model is further subject to the terms for that model on that page.

The data classification of inputs and outputs are as set forth in the following table.

Input data classification	Output data classification	Designation
Usage Data	Output (SQL query suggestion): Usage Data Query result (using SQL query suggestion): Customer Data	Covered AI Features [1]

Input data classification

Output data classification

Designation

Usage Data

Output (SQL query suggestion): Usage Data

Query result (using SQL query suggestion): Customer Data

Covered AI Features [1]

For additional information, refer to Snowflake AI and ML.